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Abstract 

Many functional logic languages are based on narrowing, a unification-based goal-solving 
mechanism which subsumes the reduction mechanism of functional languages and the 
resolution principle of logic languages. Needed narrowing is an optimal evaluation strategy 
which constitutes the basis of modern (narrowing-based) lazy functional logic languages. In 
this work, we present the fundamentals of partial evaluation in such languages. We provide 
correctness results for partial evaluation based on needed narrowing and show that the 
nice properties of this strategy are essential for the specialization process. In particular, 
the structure of the original program is preserved by partial evaluation and, thus, the 
same evaluation strategy can be applied for the execution of specialized programs. This 
is in contrast to other partial evaluation schemes for lazy functional logic programs which 
may change the program structure in a negative way. Recent proposals for the partial 
evaluation of declarative multi-paradigm programs use (some form of) needed narrowing 
to perform computations at partial evaluation time. Therefore, our results constitute the 
basis for the correctness of such partial evaluators. 

KEYWORDS: partial evaluation, functional logic programming, needed narrowing 



1 Introduction 

Functional logic languages combine the operational principles of the most important 
declarative programming paradigms, namely functional and logic programming. Ef- 
ficient demand-driven functional computations are amalgamated with the flexible 

* A preliminary short version of this paper appeared in the Proceedings of the International 
Conference on Functional Programming (ICFP'99), pp. 273-283, Paris, 1999. This paper has 
been accepted for publication in the Journal of Theory and Practice of Logic Programming. In 
contrast to the journal version, this paper contains the detailed proofs of the results presented 
in this paper. This work has been partially supported by CICYT TIC2001-2705-C03-01, by 
MCYT under grant HA2001-0059, and by the German Research Council (DFG) under grant 
Ha 2457/1-2. 
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use of logical variables providing for function inversion and search for solutions. The 
operational semantics of such languages is usually based on narrowing, a general- 
ization of term rewriting which combines reduction and variable instantiation. A 
narrowing step instantiates variables of an expression and applies a reduction step 
to a redex (reducible expression) of the instantiated expression. The instantiation 
of variables is usually computed by unifying a subterm of the entire expression with 
the left-hand side of some rule. 

Example 1 

Consider the following rules which define the less-or-equal predicate on natural 
numbers which arc represented by terms built from data constructors and s (note 
that variable names always start with an uppercase letter): 

< N — > true 
s(M) ^ -> false 
s(M) ^ s(N) -» M < N 

The goal s(X) ^ Y can be solved (i.e., reduced to true) by instantiating Y to s(Yl) 
to apply the third rule followed by the instantiation of X to to apply the first rule: 

s(X) < Y ^{y^s(yi)} X < Yl ^>{ X ^o} true 

Narrowing provides completeness in the sense of logic programming (computation of 
all solutions) as well as functional programming (computation of values) . Since sim- 
ple narrowing can have a huge search space, great effort has been made to develop 
sophisticated narrowing strategies without losing completeness; see QHanus 1994(1 
for a survey. To avoid unnecessary computations and to provide computations with 
infinite data structures as well as a demand-driven generation of the search space, 
most recent work has advocated lazy narrowing strategies, e.g., ( |Antoy et al. 2000| 
IGiovannetti et al. 1991l|Loogen et al. 1993||Moreno-Navarro and Rodriguez- Artalejo 1992] ) . 
Many lazy evaluation strategies are based on the notions of demanded or needed 
computations. The following example informally explains the difference between 
these two notions: 

Example 2 

Consider the rules for in Example ^ together with the following rules defining 
the addition on natural numbers: 

+ N -> N 
s(M)+N -> s(M + N) 

The initial term is X ^ X + X. The evaluation of subterm X + X is demanded by the 
second and third rules for , since these rules cannot be applied to X < X + X 
until the subterm X + X is reduced to a term rooted by a data constructor symbol. 
However, evaluating this subterm is not needed since, if we instantiate X to 0, we 
directly obtain true by using the first rule for 

On the other hand, if the initial term is X + (0 + 0), the evaluation of + is 
needed to compute its value whereas it is not demanded by any rule for "+'.' 
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Needed narrowing ( |Antoy et al. 2000*1 ) is based on the idea of evaluating only sub- 
terms which are needed in order to compute a result. For instance, in a term like 
h ^ h, it is always necessary to evaluate t\ (to some head normal form, i.e., cither 
a variable or a constructor-rooted term) since all three rules in Example ^ have 
left-hand sides whose first argument is not a variable. On the other hand, the eval- 
uation of fa is only needed if t\ is of the form s( - • •). Thus, if t\ is a free variable, 
needed narrowing instantiates it to a constructor, here or s(- • •). Then, depending 
on this instantiation, either the first rule is applied or the second argument t% is 
evaluated. Needed narrowing is currently the best narrowing strategy for first-order 
functional logic programs due to its optimality properties w.r.t. the length of deriva- 
tions and the number of computed solutions jAntoy et al. 2 000 ) . Informally speak- 
ing, needed narrowing derivations are the shortest possible narrowing derivations if 
common subtcrms are shared (as it is usually done in implementations of functional 
languages), and the set of all solutions computed by needed narrowing is minimal 
since needed narrowing computes only independent solutions (see also Theorem Q 
below). Furthermore, it can be efficiently implemented by pattern matching and 
unification l|Hanus 19951 |Loogen et al. 1993| ). For instance, the operational seman- 



tics of the declarative multi-paradigm language Curry (Hanus (ed.) 20031 is based 
on needed narrowing. Needed narrowing has also been extended to higher-order 
functions and A-terms as data structures and proved optimal w.r.t. the indepen- 
dence of computed solutions planus and Prehofer 1999). 

Partial evaluation (PE) is a semantics-preserving performance optimization tech- 
nique for computer programs which consists of the specialization of the program 
w.r.t. parts of its input. PE has been widely applied in the fields of term rewriting 
systems j |Bellegarde 1995|lEondorf 1988l|Dershowitz and Reddy 1993||Lafave and Gallagher 1997| ), 
functional programming ( |Consel and Danvy 1993|IJones et al. 19 93). and logic pro- 
gramming ^Gallagher 1993| |Lloyd and Shepherdson 1991||De Schreye et al. 1999| ). 
Although the objectives are similar, the general methods are often different due to 
the distinct underlying models and the different perspectives ( |Alpuente et al. 1998a| . 
This separation has the negative consequence of duplicated work since developments 
are not shared and many similarities are overlooked. A unified treatment can bring 
the different methodologies closer and lays the ground for new insights in all three 

fields < |Alpuente et al. 1998a||Alpuente et al. 1998b||Gluck and S0rensen 1994IIPettorossi and Proietti 1996al 
|S0rensen et al. 19961 ). 

In order to perform reductions at specialization time, online partial evaluators 
normally include an interpreter ( |Consel and Danvy 1 993). This implies that the 
power of the transformation is highly influenced by the properties of the evaluation 
strategy in the underlying interpreter. Narrowing-driven PE ( |Alpuente et al. 1 998a 
lAlbert and Vidal 2002jl is the first generic algorithm for the specialization of func- 
tional logic programs. The method is parametric w.r.t. the narrowing strategy which 
is used for the automatic construction of the search trees. The method is formalized 



within the theoretical framework established by Lloyd and Shepherdson (19911 for 
the PE of logic programs (also known as partial deduction) , although a number of 
concepts have been generalized to deal with the functional component of the lan- 
guage (e.g., nested function calls in expressions, different evaluation strategies, etc). 
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This approach has better opportunities for optimization thanks to the functional 
dimension (e.g., by the inclusion of deterministic evaluation steps). Also, since uni- 
fication is embedded into narrowing, it is able to automatically propagate syntactic 
information on the partial input (term structure) and not only constant values, 



similar to partial deduction. Using the terminology of Gluck and S0rensen (19961, 
narrowing-driven PE is able to produce both polyvariant and polygenetic special- 
izations, i.e., it can produce different specializations for the same function defi- 
nition and can also combine distinct original function definitions into a compre- 
hensive specialized function. This means that narrowing-driven PE has the same 
potential for specialization as positive supercompilation of functional programs 
( Sor ensen et al. 19961 and conjunctive partial deduction of logic programs \De Schreye et al.~ 999 ) 
more detailed comparisons can be found in ( |Alpuente et al. 1998a|| Alpuente et al. 1998b| 
lAlhert and Virial 20021. 

The main contribution of this work is the proof of the basic computational prop- 
erties of PE based on needed narrowing. The most recent approaches for the PE of 
multi-paradigm functional logic languages l|Albert et al. 19991 lAlbert et al. 20021 
lAlbert et al. 2003}) use (a form of) needed narrowing to perform computations at 
PE time (see also Section EJ. Therefore, our results constitute the basis for the 
correctness of such partial evaluators. To be more precise, we provide the following 
results for PE based on needed narrowing: 

• We prove the strong correctness of the PE scheme: the answers computed 
by needed narrowing in the original and the partially evaluated programs 
coincide. 

• We establish the relation between PE based on needed narrowing and PE 
based on a different lazy evaluation mechanism — which is the basis of previous 
partial evaluators ( |Alpuente et al. 1997). We formally prove the superiority 
of needed narrowing to perform partial computations. In particular, we prove 
that the structure of the original program is preserved by PE based on needed 
narrowing and, thus, the same optimal evaluation strategy can be applied for 
the execution of specialized programs. This is in contrast to previous PE 
schemes ( |Alpuente et al. 1997| ) for lazy functional logic programs which may 
change the program structure in a negative way. 

• We show that specialized programs preserve deterministic evaluations, i.e., if 
the source program can evaluate a goal without any choice, then the partially 
evaluated program does just the same. This is important from an implemen- 
tation point of view and it is not obtained by PE based on other operational 
models, like lazy narrowing. 

Providing experimental evidence of the practical advantages of using needed nar- 
rowing to perform PE is outside the scope of this paper. We refer, e.g., to ( fAlbert et al. 2002 ) 
where this topic has been extensively addressed for a practical partial evaluator 
based on the foundations presented in this paper. 

The structure of the paper is as follows. After some basic definitions in the next 
section, we recall in Section the formal definition of inductively sequential pro- 
grams and needed narrowing. Section ^ recalls the lazy narrowing strategy and 
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relates it to needed narrowing. The definition of partial evaluation based on needed 
narrowing is provided in Section [3] together with results about the structure of 
specialized programs and the (strong) correctness of the transformation. Section [B] 
outlines several recent extensions of PE based on needed narrowing. Finally, Sec- 
tion concludes. Proofs of selected results can be found in an appendix. 

2 Preliminaries 

Term rewriting systems (TRSs) provide an adequate computational model for func- 
tional languages which allow the definition of functions by means of patterns (e.g., 
Haskell, Hope or Miranda). Within this framework, the class of inductively se- 
quential programs, which we consider in this paper, has been defined, studied, and 
used for the implementation of programming languages which provide for optimal 
computations both in functional and functional logic programming ( |Antoy 1992| 
|Antoy et al. 2000| IHanus 19971 IHanus et al. 19981 |Loogen et al. 1993D . Inductively 
sequential programs can be thought of as constructor-based TRSs with discriminat- 
ing left-hand sides, i.e., typical functional programs where at most one rule is used to 
reduce a particular subterm (without variables). Thus, in the remainder of the paper 
we follow the standard framework of term rewriting ( Dcrshowit z and .Touannaud 1990) 
for developing our results. 

We consider a (many-sorted) signature £ partitioned into a set C of constructors 
and a set T of (defined) functions or operations. We write c/n £ C and / ' /n G T 
for n-ary constructor and operation symbols, respectively. There is at least one sort 
Bool containing the Boolean constructors true and false. Given a set of variables X, 
the set of terms and constructor terms are denoted by T(C U T ', X) and T(C, X), 
respectively. The set of variables occurring in a term t is denoted by Var(t). A 
term t is ground if Var(t) = 0. A term is linear if it does not contain multiple 
occurrences of one variable. We write o^" for the sequence of objects Oi, . .. , o n . 

A pattern is a term of the form f(d n ) where f/n G T and d\, . . . , d n G T(C, X). A 
term is operation-rooted if it has an operation symbol at the root, root(t) denotes 
the symbol at the root of the term t. A position p in a term t is represented by a 
sequence of natural numbers (A denotes the empty sequence, i.e., the root position). 
They are used to address the nodes of a term viewed as a tree (Dewey notation) . 
For instance, if t = f(t\, . . . , t n ), positions 1, .. . , n refer to arguments ti, . .., t n 
respectively; thus, given a position pi of a subterm of U, position i ■ pi denotes the 
corresponding subterm of t. Positions are ordered by the prefix ordering: u < v, if 
there exists w such that u-w — v. Given a term t,Vos(t) and MW os(t) denote the 
set of positions and the set of non- variable positions of t, respectively. t\ p denotes 
the subterm of t at position p, and t[s] p denotes the result of replacing the subterm 
t\ p by the term s (see l|Dershowitz and .Touannaud 1990J1 for details). 

We denote by {xi <— > ti,...,x n ^ t n } the substitution a with o~(xi) = U for 
i = 1, . . . , n (with Xi ^ Xj if i ^ j), and a(x) — x for all other variables x. The 
set Dom(a) — {x G X \ <r(x) ^ x} is called the domain of a. A substitution 
a is constructor (ground constructor), if u(x) is constructor (ground constructor) 
for all x G Vom(a). The identity substitution is denoted by id. Substitutions are 



6 



M. Alpuente et al. 



extended to morphisms on terms by a(f(t n )) = f(a(t n )) for every term f(t n ). Given 
a substitution 9 and a set of variables V C X, we denote by 8^ v the substitution 
obtained from 8 by restricting its domain to V. We write (9 = a)\V] if 6^v = &{v, 
and (0 < a) [ V] denotes the existence of a substitution 7 such that (7 o 6 = a) [ V] . 

Term t' is an instance of t if there is a substitution cr with t' = a(t). This implies 
a (relative generality) subsumption ordering on terms which is defined by t < t' 
iff t 1 is an instance of t. A unifier of two terms s and £ is a substitution cr with 
a(s) = cr(t). The unifier a is most general if (cr < c')^] for each other unifier cr'. 

A rewrite rule is an ordered pair (I, r), written I — > r, with l,r 6 T(C U 
Z ^ <Y and Var(r) C Var(Z). A set of rewrite rules is called a term rewriting system 
(TRS). The terms I and r are called the left-hand side (lhs) and the right-hand side 
(rhs) of the rule, respectively. A TRS 72. is left-linear if £ is linear for all I — ► r € 72.. 
A TRS is constructor-based (CB) if each lhs I is a pattern. Two (possibly renamed) 
rules I — > r and ?' — » r' overlap, if there is a non- variable position p € MW 'os(Z) 
and a most general unifier cr such that cr(/| p ) = cr(Z') . A left-linear TRS without 
overlapping rules is called orthogonal. In the remainder of this paper, a functional 
logic program is a finite left-linear CB-TRS. Conditions in program rules are treated 
by using the predefined functions and, if _then_else, case_of which are reduced 
by standard defining rules ( |Moreno-Navarro and Rodriguez^ Artalejo 19921 ). 

A rewrite step is an application of a rewrite rule to a term, i.e., t —> p ,r s if there 
is a position p in t, a rewrite rule R of the form I — ► r and a substitution cr with 
£|p = cr(/) and s — t[<j(r)] p (p and R will often be omitted in the notation of a 
rewrite step). The instantiated lhs a (I) is called a redex. Vos-ji(t) denotes the set 
of redex positions of the term t in the TRS 1Z. (— >*) denotes the transitive 
(reflexive and transitive) closure of — If t s, we say that t is rewritten to s. A 
term t is root-stable (often called a head-normal form) if it cannot be rewritten to 
a redex. A constructor root-stable term is either a variable or a constructor-rooted 
term, i.e., a term rooted by a constructor symbol. A term t is called irreducible or 
in normal form if there is no term s with t — > s. 

In order to evaluate terms containing variables, narrowing non-deterministically 
instantiates its variables such that a rewrite step is possible — usually by comput- 
ing most general unifiers between a subterm and some lhs (Hanus 1994), but this 
requirement is relaxed in needed narrowing steps in order to obtain an optimal 
evaluation strategy ( |Antoy et al. 2000| ). Formally, t ~~>p,R. a t' is a narrowing step 
if p is a non-variable position in t and a(t) — > p .r t'. We denote by to t n a 
sequence of narrowing steps to ~~* ai ■ ■ ■ ^ct„ t n with a = cr„ o • ■ ■ o o\ (if n = then 
cr = id). Since we are interested in computing values (constructor terms) as well 
as answers (substitutions), we say that the narrowing derivation t ^->* c computes 
the result c with answer a if c is a constructor term. The evaluation to ground 
constructor terms is the most common semantics of functional (logic) languages. In 
lazy functional (logic) languages, the equality predicate ~ used in some examples 
is defined as the strict equality on terms (note that we do not require terminating 
rewrite systems and, thus, reflexivity is not desired), i.e., the equation t\ « is 
satisfied if and only if t\ and are reducible to the same ground constructor term. 
Furthermore, a substitution cr is a solution for an equation t\ « if &(ti) ~ 
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is satisfied. The strict equality can be defined as a binary Boolean function by the 
following set of orthogonal rewrite rules: 

c w c — > true c/OeC 
c(Xi I ...,X n )Ric(Y 1 ,...,Y n ) -> (Xi wYO A...A(X n «Y n ) c/neC,n>0 
true A X — > X 

Thus, we do not treat strict equality in any special way and it is sufficient to 
consider it as a Boolean function. We say that a is a computed answer substitu- 
tion for an equation e if there is a narrowing derivation e ^* true. More details 
about strict equality can be found in flAntoy et al. 2000| IGiovannetti et al. 19911 
|Moreno-Navarro and Rodriguez- Artalejo 1992| ). 

As in logic programming, narrowing derivations can be represented by a (possibly 
infinite) finitely branching tree. Formally, given a program 1Z and an operation- 
rooted term t, a narrowing tree for t in 1Z is a tree satisfying the following conditions: 
(a) each node of the tree is a term, (b) the root node is t, and (c) if s is a node 
of the tree then, for each narrowing step s _r )<t s', the node has a child s' and 
the corresponding arc in the tree is labeled with (p,R, o~). A failing leaf contains 
a term which is not a constructor term and which cannot be further narrowed. 
Following (Llo yd and Shepherdson 1991| ), in this work we adopt the convention that 
a derivation can be incomplete (thus, a branch can be failed, incomplete, successful, 
or infinite). 



3 Needed Narrowing 

Since functional logic languages are intended to extend (pure) logic languages, com- 
pleteness of the operational semantics is an important issue. Similarly to logic 
programming, completeness means the ability to compute representatives of all 
solutions for one or more equations (this will be formalized in Theorem Q . Nar- 
rowing, as defined in the previous section, is complete but highly (don't-know) 
non-deterministic: if t is a term, we have to apply at all non-variable subterms 
all possible rules with all possible substitutions in order to compute all solutions. 
Clearly, this would be too inefficient for a realistic functional logic language. Thus, 
a challenge in the design of functional logic languages is the definition of a "good" 
narrowing strategy, i.e., a restriction on the narrowing steps issuing from a given 
term t, without losing completeness, planus 1994JI contains a survey of various 
attempts to define reasonable narrowing strategies. 

Needed narrowing ( |Antoy et al. 2000| ) is currently the best known narrowing 
strategy due to its optimality properties (see the discussion in Section ^and Theo- 
rem^. Needed narrowing is defined on inductively sequential programs, a class of 
CB-TRSs where the left-hand sides do not overlap (in particular, they are not unifi- 
able). To provide a definition of this class of programs and the needed narrowing 
strategy, we introduce definitional trees ( |Antoy 1992| ). Here we use the definition 
of QAntoy 1997D which is more appropriate for our purposes. 

A definitional tree of a finite set S of linear patterns is a non-empty set V of 
linear patterns partially ordered by subsumption having the following properties: 
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< Y — > true 



s(X') < -> false s(X') < s(Y') -> X' Y' 



Fig. 1. Definitional tree for the function 

Root property: V has a minimum element (that we denote as pattern(V)), also 

called the pattern of the definitional tree. 
Leaves property: The maximal elements of V, called the leaves of the definitional 

tree, are the elements of S. Non-maximal elements are also called branch nodes. 
Parent property: If tt € V, tt ^ pattern^), there exists a unique tt' £ V, called the 

parent of tt (and tt is called a child of 7r'), such that tt' < tt and there is no other 

pattern tt" e T(C U J 7 , X) with tt' < tt" < tt. 
Induction property: Given tt £ V\S, there is a position o in tt with tt\ G X (called 

the inductive position), and constructors ci/fci, . . . , c n /k n € C with c, 7^ for 

i 7^ j, such that, for all tt\, ...,ir n which have the parent tt, tt{ — Tr[ci(xj^)] 

(where Xk~ are new distinct variables) for all 1 < i < n. 1 

If TZ is an orthogonal TRS and f /n a defined function, we call V a definitional tree 
of f if pattern^) = f(x^) for distinct variables and the leaves of V are all (and 
only) variants of the left-hand sides of the rules in TZ defining / (i.e., rules / — > r 
such that root(t) = f,f e JT). Due to the orthogonality of 7\L, we can assign a unique 
rule defining / to each leaf. A defined function is called inductively sequential if 
it has a definitional tree. A rewrite system TZ is called inductively sequential if all 
its defined functions are inductively sequential. An inductively sequential TRS can 
be viewed as a set of definitional trees, each defining a function symbol. There can 
be more than one definitional tree for an inductively sequential function. In the 
following, we assume that there is a fixed definitional tree for each defined function. 

It is often convenient and simplifies understanding to provide a graphic repre- 
sentation of definitional trees, where each inner node is marked with a pattern, the 
inductive position in branch nodes is surrounded by a box, and the leaves contain 
the corresponding rules. For instance, the definitional tree of the function "^5" in 
Example H is illustrated in Figure ^ 

The following auxiliary proposition shows that functions defined by a single rule 
are always inductively sequential. 



Proposition 1 



1 There might be more than one potential inductive position when constructing a definitional 
tree. In this case one can select any of them since the results about needed narrowing do not 
depend on the selected definitional tree. 
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If f(t n ) is a linear pattern, then there exists a definitional tree for the set {/(£«)} 
with pattern f(x^). 

Proof 

By induction on the number of constructor symbols occurring in t, where each 
constructor symbol is introduced in a child of a branch node and each branch node 
has only one child. □ 

For the definition of needed narrowing, we assume that t is an operation-rooted 
term and V is a definitional tree with pattern(V) = tt such that tt < t. We define 
a function A from terms and definitional trees to sets of tuples (position, rule, 
substitution) as the least set satisfying the following properties. We consider two 
cases for V: 2 

1. If 7r is a leaf, i.e., V = {7r}, and 7r — > r is a variant of a rewrite rule, then 

\(t,V) = {(A,Tr^r,id)}- 

2. If 7r is a branch node, consider the inductive position o of 7r and a child 
ni = ir{a(x^)] E V. Let V{ = {n' E V \ 7Tj < n'} be the definitional tree 
where all patterns are instances of ni. Then we consider the following cases 
for the subterm t\ : 

P,R,<tot) ]£t\ = x€X,T = {xi->Ci(x£)}, 
and (p,R,a) E X(T(t),V l ); 

X(t,V) 3 { {p,R, a o id) if t\ a = c 2 (<^) and (p,R,a) £ X(t,Vi); 

(o ■ p, R, a o id) if t\ = f(t n ), f E J 7 , and (p, R, a) E X(t\ ,T") 
where V' is a definitional tree for /. 

Informally speaking, needed narrowing applies a rule, if the definitional tree does 
not require further pattern matching (case 1), or checks the subterm correspond- 
ing to the inductive position of the branch node (case 2): if it is a variable, it is 
instantiated to the constructor of a child; if it is already a constructor, we proceed 
with the corresponding child (note that we do not actually need substitution id 
but we include it to provide a normalized representation of a needed narrowing 
step, see below); if it is a function, we evaluate it by recursively applying needed 
narrowing. Thus, the strategy differs from typical lazy functional languages only in 
the instantiation of free variables. 

Note that, in each recursive step during the computation of A, we compose the 
current substitution with the local substitution of this step (which can be the iden- 
tity). Thus, each needed narrowing step can be represented as (p, R, <fk ° ■ ■ ■ ° 
where each tpj is either the identity or the replacement of a single variable computed 
in each recursive step (see the following proposition). This is also called the canon- 
ical representation of a needed narrowing step. As in proof procedures for logic 



2 This description of a needed narrowing step is slightly different from jAntoy et al. 20001 but it 
results in the same needed narrowing steps. 
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programming, we assume that the definitional trees always contain new variables 
if they are used in a narrowing step. This implies that all computed substitutions 
are idempotent (we will implicitly assume this property in the following). 

To compute needed narrowing steps for an operation- rooted term t, we take the 
definitional tree V for the root of t and compute X(t,V). Then, for all (p,R,a) G 
X(t,V), t "^p,R,a t' is a needed narrowing step. We call this step deterministic if 
X(t,V) contains exactly one element. 

Example 3 

Consider the rules in Example El Then the function A computes the following set 
for the initial term X ^ X + X: 

{(A, 0<N^ true, {X h-> 0}), (2, s(M) + N -> s(M + N), {X ^ s(M)})} 

This corresponds to the following narrowing steps: 

X X + X ^{xh^o} true 

Xs^X + X ^ {Xms (m)} s(M) sC s(M + s(M)) 

In the following we state some interesting properties of needed narrowing which are 
useful for our later results. The first proposition shows that each substitution in a 
needed narrowing step instantiates only variables occurring in the initial term. 

Proposition 2 

If (p, R,<pk ° ■ • • ° <£>i) G X(t, V) is a needed narrowing step, then, for i = 1, . . . , k, 
ipi = id or (pi — {x i y c(x^)} (where are pairwise different variables) with 
x G Var((p,i-i o • ■ ■ o ipi(t)). 

Proof 

By induction on k. □ 

The next lemma shows that for different narrowing steps (computing different sub- 
stitutions) there is always a variable which is instantiated to different constructors: 

Lemma 1 

Let t be an operation-rooted term, V a definitional tree with pattern{V) < t and 
(p,R,<Pk°--'°<Pi),(p',R',<p' k ,°---°<Pi) G A(t,P), k < k'. Then, for alH G {1, . . .,k}, 

• either ifi o • • • o (pi = (pt o • • • o <p^, or 

• there exists some j < i with 

1. ipj o • • • o ipi = ip'- o • • • o ip' 1: and 

2. ifij+i — {x i— > c(- ■ •)} and <^j+i — { x l— * c '(' ' ')} w hh c ^ c'. 
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By induction on k (the number of recursive steps performed by A to compute 
(p,R, <Pk ° • " ° <Pi))-- 

k = 1: Then V = {ir} and A(t , 7^) = {(A, R, id)}. Thus, the proposition trivially 
holds. 

k > 1: Then tt — pattern(V) is a branch node and there is an inductive position 
o of 7r such that all children of it have the form Hi = 7r[ci(x^)] £ V . Let 
Pi = {tt' E V \ TTi < 7r'} be the definitional tree where all patterns are instances 
of 7Tj, for i = 1, . ..,n. We prove the induction step by a case distinction on the 
form of the subterm t\ : 

t\ = x £ X: Then ip x = {x ^ Ci(x^)} and (p, R, tp k o • • • o ip 2 ) £ \(<pi(t),Vi) for 
some i. If Lp' x = {x i— > c(- • •)} with c ^ Cj, then the proposition directly holds. 
Otherwise, if ipi = ip[, the proposition follows from the induction hypothesis 
applied to (j>, R, cp k o ■ ■ ■ o tp 2 ), (p 1 , R',ip' k , o ■■■ o cp' 2 ) g \((pi(t),Vi). 

t\o — Ci(t n ): Then ipi — id and (p,R,ipk o ■ ■ ■ o ip 2 ) 6 X(t,Vi). Clearly (p[ = id 
by definition of A. Hence the proposition follows from the induction hypothesis 
applied to (p, R, <p k o ■ • • o ip 2 ), (p', R', <p' k ,o---o ip' 2 ) e \{t, Vi). 

t\ = f(t n ): Then ipi — id and (p, R, (p k ° • ■ • ° f 2 ) £ A(f| ,'P') where P' is a 
definitional tree for /. By definition of A, (p[ = id. Then the proposition follows 
from the induction hypothesis applied to (p, R, ip k ° " " • ° ^2), {p' , R', o ■ • • o 
y/ a ) G A(<| ,7"). 

□ 

For inductively sequential programs, needed narrowing is sound and complete w.r.t. 
strict equality when we consider constructor substitutions as solutions (note that 
constructor substitutions are sufficient in practice since a broader class of solutions 
would contain unevaluated or undefined expressions for the considered programs). 
Moreover, needed narrowing does not compute redundant solutions. These prop- 
erties are formalized as follows, where we say that two substitutions a and a' are 
independent (on a set of variables V C X) iff there is some x £ V such that a(x) 
and cr'(x) are not unifiable. 3 

Theorem 1 \Antoy et al. 2000$ 

Let 1Z be an inductively sequential program and e an equation. 

1. (Soundness) If e true is a needed narrowing derivation, then a is a solution 
for e. 

2. (Completeness) For each constructor substitution a that is a solution of e, 
there exists a needed narrowing derivation e true with a' < o~[Var(e)]. 

3. (Minimality) If e ^* an( i e ^ rMe are ^ w0 distinct needed narrowing 
derivations, then a and a 1 are independent on Var(e). 



3 Actually, flAntoy et al. 2000) prove a stronger property (disjointness of solutions) but this is 
not necessary here. 
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An important advantage of functional logic languages in comparison to pure logic 
languages is their improved operational behavior by avoiding non-deterministic 
computation steps. One reason for that is a demand-driven computation strat- 
egy which can avoid the evaluation of potential non-deterministic expressions. For 
instance, consider the rules in Examples and [3] and the term ^ X + X. Needed 
narrowing evaluates this term by one deterministic step to true. In an equivalent 
logic program, this nested term must be flattened into a conjunction of two pred- 
icate calls, like +(X,X, Z) A ^(0,Z), which causes a non-deterministic computation 
due to the predicate call +(X,X,Z). 4 Another reason for the improved operational 
behavior of functional logic languages is the ability of particular evaluation strate- 
gies (like needed narrowing or parallel narrowing ( |Antoy et al. 19971 )) to evaluate 
ground terms in a completely deterministic way, which is important to ensure an 
efficient implementation of purely functional evaluations. This property, which is 
obvious by the definition of needed narrowing, is formally stated in the following 
proposition. For this purpose, we call a term t deterministically evaluable (w.r.t. 
needed narrowing) if each step in a narrowing derivation issuing from t is determi- 
nistic. A term t deterministically normalizes to a constructor term c (w.r.t. needed 
narrowing) if t is deterministically evaluable and there is a needed narrowing deriva- 
tion t c (i.e., c is the normal form of t). 

Proposition 3 

Let 1Z be an inductively sequential program and t be a term. 

1 . If i c is a needed narrowing derivation, then t deterministically normal- 
izes to c. 

2. If t is ground, then t is deterministically evaluable. 

4 Lazy Narrowing and Uniform Programs 

One of the main objectives of this work is to clarify the relation between the defini- 
tion of a PE scheme based on needed narrowing and a previous PE method based 
on lazy narrowing | |Alpuente et al.~ 997). In order to show the improvements ob- 
tained by using needed narrowing to perform partial computations, we first provide 
a brief review of the lazy narrowing strategy in this section. 

Lazy narrowing reduces expressions at outermost narrowable positions. Narrow- 
ing at inner positions is performed only if it is demanded (by the pattern in the 
lhs of some rule). In the following, we specify a lazy narrowing strategy which is 
similar to ( |Moreno-Navarro and Rodriguez- Artalejo 1 992). 

The following definitions are necessary for our formalization of lazy narrowing. 
A linear unification problem is a pair of terms: S = (f (d n ) , / (£„)} , where f(d n ) and 
f(t n ) do not share variables, and f(d n ) is a linear pattern. Linear unification LU(<5) 
can either succeed, fail or suspend, delivering (Succ, a), (Fail, 0) or (Demand, P), 

4 Such non-deterministic computations could be avoided using Prolog systems with coroutining 
which allow the suspension of some non-deterministic computations, but then we are faced with 
the problem of floundering and incompleteness. 
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respectively, where P is the set of demanded positions which require further eval- 
uation; details can be found in | |Alpuente et al. 1997| ). 

We define the lazy narrowing strategy in the following definition. Roughly speak- 
ing, the set-valued function Xi aZ y(t) returns the set of triples (p,R,o~) such that p 
is a demanded position of t which can be narrowed by the rule R with substitution 
a (where a is a most general unifier of t\ p and the left-hand side of R). We assume 
the rules of 1Z to be numbered with R\ , . . . , R r , 



Definition 1 [lazy narrowing strategy) 

Xlazy(t) = U™=lM^ A > fc ) 

\-{t,p,k) = if rootQk) = root(t\ p ) then 

{ (Succ, cr): {(p,R k ,a)} 
(Fail, 0) : 
(Demand, P) : \J qeP |jr=i A_(t, p ■ q, k) 

else 

where Rk = (4 — ► ffe) is a (renamed apart) rule of 1Z. 



Example 4 

Consider the rules for and "+" in Examples ^ an d El Then lazy narrowing 
evaluates the term X $C X + X by applying a narrowing step at the top (with the 
first rule for ) or by applying a narrowing step to the second argument X + X 
since this is demanded by the second and third rule for . Thus, there are three 
lazy narrowing steps: 

X ^ X + X ^{x^o} true 

X^X + X ^{x^o} 0^0 

X^X + X ^ {x ^s(M)} s(M) < s(M + s(M)) 

Note that the second lazy narrowing step is in some sense superfluous since it also 
yields the final value true with the same binding as the first step. The avoidance 
of such superfluous steps by using needed narrowing will have a positive impact on 
the PE process, as we will see later. 

In orthogonal programs, lazy narrowing is complete w.r.t. strict equality and con- 
structor substitutions: 

Proposition 4 ^Moreno- Navarro and R odriguez- Artalej o 1992$ 
Let 1Z be an orthogonal program, e an equation, and a a constructor substitution 
that is a solution for e. Then there is a lazy narrowing derivation e true such 
that a' < a[Var(e)}. 

Thus, lazy narrowing is complete for a larger class of programs than needed nar- 
rowing (since inductively sequential programs are always orthogonal), but it may 
have a worse behavior than needed narrowing (see Example 0J. Nevertheless, the 
idea of needed narrowing can also be extended to almost orthogonal programs 
QAntoy e~ al. 1997), but then the optimality properties are lost. There exists a class 
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of programs where the superfluous steps of lazy narrowing are avoided, since lazy 
narrowing and needed narrowing coincide on this class. These are the uniform pro- 
grams IjZartmann 1997]) which are inductively sequential programs where at most 
one constructor occurs in the left-hand side of each rule. A program is uniform if 
each function / is defined by one rule / (T^) -> r or the left-hand side of every rule 
Ri defining / is left-linear and has the form / (~Xk , Ci (y^~) , z^) , where the construc- 
tors Ci are distinct in different rules. Note that uniform programs are orthogonal. 
In the latter case, an evaluation of a call to / demands its (k + l)-th argument. A 
different definition of uniform programs can be found in l|Kuchen et al. 1990). 

There is a simple mapping IA from inductively sequential into uniform programs 
which is based on flattening nested patterns, see ( Zartman n 1997JI . For instance, if 
1Z is the program in Example ^ then U(TV) consists of the rules 

^ N — > true M ^' — > false 

s(M) ^ N -> M <' N M^'s(Nl) -> M<N1 

where is a new function symbol. 

The following theorem states a correspondence between needed narrowing deriva- 
tions using the original program and lazy narrowing derivations in the transformed 
uniform program. For a more detailed comparison between needed narrowing and 
lazy narrowing, we refer to \ Alpuente et al. 2003| ). 

Theorem 2 \Zartmann 1997[ 

Let 1Z be an inductively sequential program, U(1Z) the transformed uniform pro- 
gram, and t an operation-rooted term. Then there exists a needed narrowing deriva- 
tion t ~»* s w.r.t. 1Z to a constructor root-stable form s iff there exists a lazy 
narrowing derivation t s w.r.t. U(7Z). 

5 Partial Evaluation with Needed Narrowing 

In this section, we introduce the basic notions of PE in (lazy) functional logic 
programming. Then, we analyze the fundamental properties of PE based on needed 
narrowing and establish the relation with PE based on lazy narrowing. 

Partial evaluation is a semantics-based program optimization technique which 
has been investigated within different programming paradigms and applied to a 
wide variety of languages. The first PE framework for functional logic programs 
has been defined by \ Alpuente et al. 199 8a). In this framework, narrowing (the 
standard operational semantics of integrated languages) is used to drive the PE 
process; similarly to partial deduction, specialized program rules are constructed 
from narrowing derivations using the notion of resultant. In the following, s ^>jr t 
denotes a narrowing derivation with at least one narrowing step. 

Definition 2 [resultant) 

Let K be a TRS and s be a term. Given a narrowing derivation s ^+ t, its 
associated resultant is the rewrite rule er(s) — > t. 

Note that, whenever the specialized call s is not a linear pattern, the left-hand 
sides of resultants may not be linear patterns either and hence resultants may not 
be program rules: 
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Example 5 

Consider the following inductively sequential program: 

double (X) -> X + X 
+ N -> N 
s(M) + N -> s(M + N) 

Given the term double (W) + W and the following needed narrowing derivation (the 
selected redex is underlined at each narrowing step): 

double (W) + W ~> ld ( W + W ) + W ^ {Wms (m)} s(M + s(M)) + s(M) 

we compute the associated resultant: 

double(s(M)) + s(M) -> s(M + s(M)) + s(M) 

This resultant is not a legal program rule since its left-hand side contains nested 
defined function symbols ("+" and "double") as well as multiple occurrences of 
the same variable. 

In order to produce legal program rules, we introduce a post-processing of renaming 
which not only eliminates redundant structures but also obtains independent spe- 
cializations in the sense of ( |Lloyd an d Shcphcrds on 1991| ). Furthermore, it is also 
necessary for the correctness of the PE transformation. Roughly speaking, indepen- 
dence ensures that the different specializations for the same function definition are 
correctly distinguished, which is crucial for polyvariant specialization. 

The (pre-)partial evaluation of a term s is obtained by constructing a (possibly 
incomplete) narrowing tree for s and then extracting the specialized definitions (the 
resultants) from the non-failing, root-to-leaf paths of the tree. 

Definition 3 {pre-partial evaluation) 

Let TZ be a TRS and s a term. Let T be a finite (possibly incomplete) narrowing tree 
for s in TZ such that no constructor root-stable term in the tree has been narrowed. 
Let t n be the terms in the non-failing leaves of T. Then, the set of resultants 
{o~i(s) — > ti | i = 1, . . . , n} for the narrowing sequences {s U \ i — 1, . . . , n} is 
called a pre-partial evaluation of s in TZ. 

The pre-partial evaluation of a set of terms S in TZ is defined as the union of the 
pre-partial evaluations for the terms of S in TZ. 

Example 6 

Consider the following function append to concatenate two lists (here we use "nil" 
and ":" as constructors of lists): 

append(nil, Y s ) — > Y s 

append(X : X S ,Y S ) — > X : append(X s , Y s ) 

together with the set of calls S — {append(append(X s , Y s ), Z s ), append(X s , Y s )}. 
Given the needed narrowing trees of Figure the associated pre-partial evaluation 
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append(append(X s , Y s ), Z s ) 






\^{X B i * X' : X'J 


append(Y s , Z s ) 


append(X' : append(Xs, Y s ), Z s ) 


append(Y s , Z s ) 


X' : appeJd(append(X' s ,Y s ),Z s ) 






^--^{Ys i — > Y' : Yg } 


Zs 


Y' : append^, Z s ) 



Fig. 2. Needed Narrowing trees for append(append(X s , Y s ), Z s ) and append(X s , Y s ). 

of S in 1Z is as follows: 

append(append(nil,Y s ),Z s ) — *■ append(Y s , Z s ) 
append(append(X : X s , Y s ), Z s ) — ► X : append(append(X s , Y s ), Z s ) 
append(nil, Z s ) — ► Z s 
append(Y : Y s , Z s ) — > Y : append(Y s , Z s ) 

The following example illustrates that the restriction not to evaluate terms in con- 
structor root-stable form cannot be dropped. 

Example 7 

Consider the following program 1Z: 

f(0) -> 
g(X) - s(f(X)) 
h(s(X)) - s(0) 

together with the set of calls S = {g(X),h(X)}. Given the needed narrowing deriva- 
tions: 

g00~>i„ s(f(X)) ^ {!h0} s(0) 
h(X) ->{x^ s (y)} s(0) 

a pre-partial evaluation of S in 1Z is the following program 7V: 

g(0) - s(0) 
h(s(X)) - 8(0) 

Now, the equation h(g(s(0))) « X has the following successful needed narrowing 
derivation in 7?.: 

h( g(s(0)) ) « X ^ id h(s(f(s(0)))) « X -> id s(0) w X ^{ Xms (o) } true 
whereas it fails in the specialized program 1Z' . 
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The problem shown in the above example is due to the backpropagation of bindings 
to the left-hand sides of resultants: within a lazy context, the instantiation of the 
left-hand sides of resultants with bindings which come from the evaluation of terms 
in constructor root-stable form may incorrectly restrict the domain of functions 
(e.g., function "g" above). 

A recursive closedness condition, which guarantees that each call which might 
occur during the execution of the resulting program is covered by some program 
rule, is formalized by inductively checking that the different calls in the rules are 
sufficiently covered by the specialized functions. For instance, a function call like 
s(X) + Y cannot be considered closed w.r.t. the set of calls {0 + Y, s(0) + Y}. 

Informally, a term t rooted by a defined function symbol is closed w.r.t. a set of 
calls S , if it is an instance of a term of S and the terms in the matching substitution 
are recursively closed by S. 

Definition 4 (closedness) 

Let S be a finite set of terms. We say that a term t is 5-closed if closed(S, t) holds, 
where the predicate closed is defined inductively as follows: 



where C* = (CU{«,A}). 

We say that a set of terms T is 5-closed, written closed(S, T), if closed(S,t) 
holds for all t € T, and we say that a TRS 1Z is S'-closed if closed (S ,TZ ca iis) holds. 
Here we denote by 7Z ca u s the set of the right-hand sides of the rules in 1Z. 

For instance, the pre-partial evaluation of Example [5] is closed w.r.t. the set of 
partially evaluated calls {append(append(X s , Y s ), Z s ), append(X s , Y s )}. 

According to the (non-deterministic) definition above, an expression rooted by a 
"primitive" function symbol, such as a conjunction t\ A £2 or an equation t\ w £2, 
can be proved closed w.r.t. S either by checking that t\ and £2 are 5-closed or by 
testing whether the conjunction (equation) is an instance of a call in S (followed 
by an inductive test of the subterms). This is useful when we are not interested in 
specializing complex expressions (like conjunctions or equations) but we still want 
to run them after specialization. Note that this is safe since we consider that the 
rules which define the primitive functions and "A" are automatically added to 
each program by existing programming environments, hence calls to these symbols 
are steadily covered in the specialized program. A general technique for dealing 
with primitive symbols which deterministically splits terms before testing them for 
closedness can be found in (A lbert et al. 1 998). 

In general, given a call s and a program 1Z, there exists an infinite number of dif- 
ferent pre-partial evaluations of s in 1Z. A fixed rule for generating resultants called 
an unfolding rule is assumed, which determines the expressions to be narrowed (by 
using a fixed narrowing strategy) and which decides how to stop the construction of 
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narrowing trees; see IjAlbert et al. 19981 |Alpuente et al. 1998a| lAlbert et al. 2002|1 
for the definition of concrete unfolding rules. 

In the following, we denote by pre-NN-PE and pre-LN-PE the sets of resultants 
computed for S in 1Z by considering an unfolding rule which constructs finite needed 
and lazy narrowing trees, respectively. We will use the acronyms NN-PE and LN- 
PE for the renamed rules which will result from the corresponding post-processing 
of renaming. The idea behind this transformation is that, for any call (which is 
closed w.r.t. the considered set of calls), the answers computed for this call in the 
original program and the answers computed for the renamed call in the specialized, 
renamed program do coincide. In particular, in order to define a partial evaluator 
based on needed narrowing and to ensure that the resulting program is inductively 
sequential whenever the source program is, we have to make sure that the set 
of specialized terms (after renaming) contains only linear patterns with distinct 
root symbols. This can be ensured by introducing a new function symbol for each 
specialized term and then replacing each call in the specialized program by a call 
to the corresponding renamed function. In particular, the left-hand sides of the 
specialized program (which are constructor instances of the specialized terms) are 
replaced by instances of the corresponding new linear patterns through renaming. 

Definition 5 (independent renaming) 

An independent renaming p for a set of terms S is a mapping from terms to terms 
defined as follows: for s G S , p(s) — f s (x^), where are the distinct variables in s 
in the left-to-right ordering and f s is a new function symbol, which does not occur 
in 1Z or S and is different from the root symbol of any other p(s'), with s' £ S and 
s' ^ s. We also denote by p(S) the set 5" = {p(s) | s e S}. 

Example 8 

Consider the set S = {append(append(X s , Y s ), Z s ), append(X s , Y s )}. The following 
mapping: 

p= {append(X s , Y s ) h-> app(X s ,Y s ), append(append(X s , Y s ), Z s ) h-> dapp(X s , Y s , Z s ) } 
is an independent renaming for S . 

While independent renamings suffice to rename the left-hand sides of resultants 
(since they are constructor instances of the specialized calls), the right-hand sides 
are renamed by means of the auxiliary function ren p , which recursively replaces 
each call in the given expression by a call to the corresponding renamed function 
(according to p). 

Definition 6 (renaming function) 

Let S be a finite set of terms and p an independent renaming of S . Given a term 
t, the non-deterministic function ren p is defined as follows: 

t ifteX 



ren p (t) 



c(ren p (t n )) if t = c(£„), c 6 C*, and n > 
8'(p(s)) if 36, 3s e S such that t = 6(s) and 

9' = {x ren p (9(x)) \ x £ Vom(6)} 
t otherwise 
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where C* = (C U {»,A}). 

Similarly to the test for closedness, an equation s w t can be (non-deterministically) 
renamed either by independently renaming s and t or by replacing the considered 
equation by a call to the corresponding new, renamed function (when the equation 
is an instance of some specialized call in S). Note also that the renaming function 
is a total function: if an operation-rooted term t is not an instance of any term in 
S (which can occur if t is not 5-closed), the function ren p (t) returns t itself (i.e., 
term t is not renamed). 

The notion of partial evaluation can be formally defined as follows. 

Definition 7 {partial evaluation) 

Let 1Z be a TRS, S a finite set of terms and 1Z' a pre-partial evaluation of 1Z w.r.t. 
S . Let p be an independent renaming of S. We define the partial evaluation TZ" of 
1Z w.r.t. S (under p) as follows: 



We now illustrate these definitions with an example. 
Example 9 

Let us consider the program append and the set of terms S of Example together 
with the independent renaming p of Example[S| A partial evaluation 1Z' of 1Z w.r.t. 
S (under p) is: 



Note that, for a given renaming p, the renamed form of a program 1Z may depend 
on the strategy which selects the term from p(S) which is used to rename a given 
call t in 1Z (e.g., append(append(X s , Y s ), Z s )), since there may exist, in general, more 
than one term in S that covers the call t. Some potential specialization might be 
lost due to an inconvenient choice. Appropriate heuristics which are able to produce 
the best potential specialization have been introduced in the implementation of the 
partial evaluator described in l|Albert et al. 20 02 ). 

The correctness of LN-PE is stated in IjAlbert et al. 19981 |Alpuente et al. 1997| ). 
It is important to clarify that, even if the methodology for narrowing-driven PE in 
( |Alpuente et al. 1998a| ) is parametric w.r.t. the narrowing strategy, this framework 
only ensures that: 

• partially evaluated programs are closed w.r.t. the set of partially evaluated 
calls — which is necessary, although does not suffice, to guarantee the com- 
pleteness of the transformation — , and 

• the PE process always terminates. 



k"=U(#)) 



ren p (r) | 9(s) — > r 6 1Z' is a resultant for s in TZ} 



dapp(nil,Y s ,Z s ) 
dapp(X : X S ,Y S ,Z S ) 
app(nil, Y s ) 
app(X : X S ,Y S ) 



app(Y s ,Z s ) 

X : dapp(X s ,Y s ,Z s ) 

Y s 

X : app(X s ,Y s ) 



In particular, the correctness of the PE transformation cannot be proved in a way 
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independent of the narrowing strategy. These results are by their nature highly de- 
pendent on the concrete strategy which is considered, as it is known that different 
narrowing strategies have quite different semantic properties. In fact, the use of a 
lazy evaluation strategy imposes some additional restrictions on PE, such as the 
use of "strict equality" , the requirement not to evaluate terms in constructor root- 
stable form during PE, or the need for an additional post-processing of renaming. 
All these additional requirements are essential to ensure the correctness of the trans- 
formation and were not present in the original framework of ( |Alpuente et al. 1 998a, 
|Alpuente et al. 1998b| ), where correctness is only proved for an eager narrowing 
strategy. Therefore, it was necessary to develop a new theory for PE based on lazy 
narrowing as a separate work (Alpue nte et al. 1997| ), which is now overcome by the 
needed narrowing methodology formalized in this article. 

The following lemma shows that any PE based on needed narrowing can also be 
obtained (but possibly with more steps) by PE of the transformed uniform pro- 
gram based on lazy narrowing. This means that, in some sense, the specializations 
computed by a partial evaluator based on needed narrowing cannot be worse than 
the specializations computed by a partial evaluator based on lazy narrowing. On 
the other hand, we will also show later that there are cases where a LN-PE is worse 
than a NN-PE for the same original program. 

Lemma 2 

Let TZ be an inductively sequential program, TZ U = U(TZ) the corresponding uniform 
program, and S a finite set of operation-rooted terms. If TZ' is an NN-PE of S in 
TZ, then TZ' is also an LN-PE of S in TZ U . 

Proof 

Since the final renaming applied in the partial evaluation of a program does not 
depend on the narrowing strategy used during the pre-partial evaluation, it suffices 
to show that each resultant w.r.t. needed narrowing in TZ corresponds to a resultant 
w.r.t. lazy narrowing in TZ U . Due to the definition of a resultant, each rule in the 
pre-partial evaluation w.r.t. needed narrowing in TZ has the form 

a(t) -> s 

where t 6 S and t ^+ s is a needed narrowing derivation w.r.t. TZ. By Theorem|21 
there exists a lazy narrowing derivation t s w.r.t. TZ U which has the same 
answer and result (note that Theorem[2]states this property only for derivations into 
constructor-rooted terms, but it also holds in the direction used here for arbitrary 
needed narrowing derivations since each needed narrowing step corresponds to a 
sequence of lazy narrowing steps w.r.t. the transformed uniform programs, which 
can be seen by the proof of this theorem). Thus, a(t) — > s is a resultant of this lazy 
narrowing derivation w.r.t. TZ U . □ 

The following theorem states an important property of PE based on needed nar- 
rowing: if the input program is inductively sequential, then the partially evaluated 
program is also inductively sequential and, thus, we can also apply the needed 
narrowing strategy to evaluate calls in the specialized program. The proof of this 



Specialization of Functional Logic Programs Based on Needed Narrowing 21 



theorem can be found in |Appcndix A| An extension of this theorem — although it 
relies on the result below regarding the unfolding transformation — in the context 
of a more general fold/unfold framework can be found in | |Alpuente et al. 2004| ). 

Theorem 3 

Let TZ be an inductively sequential program and S a finite set of operation-rooted 
terms. Then each NN-PE of TZ w.r.t. S is inductively sequential. 

The following example reveals that, when we consider lazy narrowing, the LN-PE 
of a uniform program w.r.t. a linear pattern may not be uniform. 

Example 10 

Let TZ be the following uniform program: 

f(X,b) - g(X) 
g(a) -> a 

and t = f (X,Y) and p(t) = f2(X,Y). Then a LN-PE W of (in TZ (under p) is 

f2(a,b) —> a 
which is not uniform. 

The residual program TZ' in the example above is inductively sequential. This raises 
the question whether the LN-PE of a uniform program is always inductively sequen- 
tial. Corollary ^ will positively answer this question. 

Corollary 1 

Let TZ be a uniform program and 5* a finite set of operation-rooted terms. If TZ' is 
a LN-PE of S in TZ, then TZ' is inductively sequential. 

Proof 

Since a uniform program is inductively sequential and lazy narrowing steps w.r.t. 
uniform programs are also needed narrowing steps (cf. proof of Theorem |5J| , the 
proposition is a direct consequence of Theorem [3J □ 

The uniformity condition in Corollary^cannot be weakened to inductive sequential- 
ly when LN-PEs are considered, as demonstrated by the following counterexample. 

Example 11 

Let TZ be the following inductively sequential program: 



f(a,a, a) - 


-> b 


h(a,b,X) 


b 


f(b,b,X) - 


-» b 


h(e,X,k) - 


-» b 


g(a,b,X) - 


-» b 


i(X,c,d) - 


■4 b 


g(X,c,d) - 


-> b 


i(e,X,k) - 


-> b 



Let t = f (g(X,Y, Z),h(X,Y, Z), i(X,Y,Z)) <E S and p be a renaming such that p(t) = 
f3(X,Y, Z). Then, every LN-PE TZ' of S in TZ (considering depth-2 lazy narrowing 
trees to construct the resultants) contains the rules: 

f3(a,b,X) -> ■•• 
f3(e,X,k) -> ■•• 
f3(X, c,d) -> ••• 

and thus TZ' is not inductively sequential. 
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One of the main factors affecting the quality of a PE is the treatment of choice 
points ( |Leuschel and Bruynooghe 2002| [Gallaghe r 1993| ). The following examples 
illustrate the different way in which NN-PE and LN-PE "compile-in" choice points 
during unfolding, which is crucial to performance since a poor control choice dur- 
ing the construction of the computation trees can inadvertently introduce extra 
computation into a program. 

Example 12 

Consider again the rules of Examplc|3and the input term X ^ X + Y. The computed 
LN-PE is as follows: 

leq2(0,N) — > true 
leq2(0,N') — > true 
leq2(s(M),N) -> leq2(M,N) 

where the renamed initial term is leq2(X, Y). The redundancy of lazy narrowing 
has the effect that the first two rules of the specialized program are identical (up 
to renaming). In contrast, a better specialization — without generating redundant 
rules — is obtained by PE based on needed narrowing, since the NN-PE consists of 
the following rules: 

leq2(0,N) — > true 
leq2(s(M),N) -> leq2(M,N) 

Note that a call-by- value partial evaluator based on innermost narrowing ( |Alpuente et al. 1998a| 
has an even worse behavior in this example since it does not specialize the program 
at all. 

In the example above, the superfluous rule in the LN-PE can be avoided by removing 
duplicates in a post-processing step. The next example shows that this is not always 
possible. 

Example 13 

Lazy evaluation strategies are necessary if one wants to deal with infinite data 
structures and possibly non-terminating function calls. The following orthogonal 
program makes use of these features: 

f(0,0) -> s(f(0,0)) g(0) -> g(0) 

f(s(N),X) -> s(f(N,X)) h(s(X)) -> 

The specialization is initiated with the term h(f (X, g(Y))). Note that this term 
reduces to if X is bound to s(- • •), and it does not terminate if X is bound to 
due to the nonterminating evaluation of the second argument. The NN-PE of this 
program perfectly reflects this behavior (the renamed initial term is h2(X, Y)): 

hO ->■ hO h2(0,0) -> hO 

h2(s(X),Y) -> 

On the other hand, the LN-PE of this program has a worse structure: 

hl(X) -> hl(X) h2(X,0) -> hl(X) 

hl(s(X)) -> h2(s(X),Y) -> 

h2(s(X),0) -> 
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The program specialized by LN-PE in the example above is not inductively se- 
quential (nor orthogonal), in contrast to the original one. This does not only mean 
that lazy and needed narrowing are not applicable to the specialized program but 
also that the specialized program has a worse termination behavior than the orig- 
inal one. For instance, consider the term h(f (s(0), g(0))). The evaluation of this 
term has a finite derivation tree w.r.t. lazy narrowing as well as needed narrow- 
ing in the original program. However, the renamed term h2(s(0),0) has a finite 
derivation tree w.r.t. the NN-PE but an infinite derivation tree w.r.t. the LN-PE 
(using lazy narrowing); the infinite branch is caused by the application of the rules 
h2(X,0) hl(X) and hl(X) -> hl(X). 

This last example also shows that LN-PE can destroy the advantages of determi- 
nistic reduction of functional logic programs, which is not possible using NN-PE. 
This is ensured by the following theorem, which guarantees that a term which 
is deterministically normalizablc w.r.t. the original program cannot cause a non- 
deterministic evaluation w.r.t. the specialized program obtained by NN-PE. 

Theorem 4 

Let 1Z be an inductively sequential program, S a finite set of operation-rooted terms, 
p an independent renaming of S , and e an equation. Let 1Z' be a NN-PE of 1Z w.r.t. 
S (under p) such that lZ'U{e'} is S"-closed, where e' = ren p (e) and S' — p(S). If e 
deterministically normalizes to true w.r.t. 7Z, then e' deterministically normalizes 
to true w.r.t. 1Z' . 

Proof 

Since e deterministically normalizes to true w.r.t. 7Z, there is a needed narrowing 
derivation e true in 7Z. By Theorem[S](see below), there is a needed narrowing 
derivation e' true in 1Z' with a = id[Var(e)]. This implies a = id by definition 
of needed narrowing. Therefore, e' deterministically normalizes to true w.r.t. TZ' by 
Proposition |3| □ 

This property of specialized programs is desirable and important from an im- 
plementation point of view, since the implementation of non-deterministic steps 
is an expensive operation in logic-oriented languages. Moreover, additional non- 
determinism in the specialized programs can result in additional infinite derivations, 
as shown in Example El This might have the effect that solutions are no longer 
computable in a sequential implementation based on backtracking. Essentially, de- 
terministic computations are preserved thanks to the use of needed narrowing over 
inductively sequential programs to perform partial computations. For instance, con- 
sider the function "leq" of Example ^ together with the simple function "f oo": 

foo(0) -> 

Given a function call of the form X ^ foo(Y), many narrowing strategies (e.g., lazy 
narrowing) have two ways to proceed: either by reducing the call to function 
using the first rule 

X ^ f oo(Y) -^{Xh^o} true 
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and by reducing the call to function "f oo" (which is demanded by the second and 
third rules of 

X < foo(Y) -^>{y m o} X < 

Thus, their associated resultants are as follows: 

< f oo(Y) — > true 
X < f oo(0) -> X < 

Now, given a call of the form ^ foo(Z), both resultants are applicable but the 
second one is clearly redundant. Actually, the second resultant is only meaningful 
to evaluate those calls whose first argument is of the form s(- • •), since only the 
second and third rules of demanded the evaluation of call foo(0) that gave 
rise to this resultant. The advantage of using needed narrowing is that it applies 
some additional bindings so that this information is made explicit in the computed 
resultants, e.g., the resultants obtained by needed narrowing are 

< foo(Y) — > true 
s(Z) ^ foo(0) -> s(Z) < 

thus avoiding the creation of additional non-determinism. This property is some- 
how related to the notion of perfect splits used in IjAbramov and Chick 20001 
lAbramov and Gliick 20021 IGliick and Klimov 1993J1 to guarantee that no compu- 
tations are neither lost nor added when constructing — by driving IjTurchin 19 86 ) , a 
symbolic execution mechanism which shares many similarities with lazy narrowing — 
the perfect process trees of (positive) supercompilation jS0rensen et al. 1996). 

Note that there is no counterpart of this property in the partial deduction of 
logic programs, since the considered execution mechanism (some variant of SLD- 
resolution) never demands — in a don 't-know non-deterministic way — the evaluation 
of different atoms of the same goal. 

Finally, we state the strong correctness of NN-PE, which amounts to the com- 
putational equivalence between the original and the specialized programs (i.e., the 
fact that the two programs compute exactly the same answers) for the considered 
goals. The proof of this theorem can be found in | Appendix B| 

Theorem 5 (strong correctness) 

Let 1Z be an inductively sequential program. Let e be an equation, V 3 Var(e) a 
finite set of variables, S a finite set of operation-rooted terms, and p an independent 
renaming of 5". Let 1Z! be a NN-PE of Tl w.r.t. 5" (under p) such that K'U{e'} is S'- 
closed, where e' — ren p (e) and S' — p(S). Then, e ^* true is a needed narrowing 
derivation for e in 7Z iff there exists a needed narrowing derivation e' ~>*/ true in 
1Z' such that (a' = cr)[V] (up to renaming). 

It is worthwhile to note that the correctness of NN-PE cannot be derived from the 
correctness of LN-PE | |Alpuente et al. 1997| ) , since the preservation of inductive se- 
quentially (cf. TheoremlSJ is a crucial point in our proof scheme, and this property 
does not hold for LN-PE. 

On the other hand, it is well-known that partial evaluation can be defined within 
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the fold/unfold framework (Pettoro ssi and Proietti 1996b|l by using only unfolding 
and a restricted form of folding. Hence the correctness of NN-PE could be de- 
rived from the correctness of a fold/unfold framework for the transformation of 
functional logic programs based on needed narrowing. However, the only frame- 
work of this kind in the literature is ( |Alpuente et al. 1999| |Alpuente et al. 2004| ) 
and their proofs of correctness — regarding the unfolding transformation — rely on 
the results in this article. The precise relation between partial evaluation and the 
fold/unfold transformation — for lazy functional logic programs — can be found in 
HAlpuente et al. 2000| ). 



6 Further Developments 

In the previous sections, we introduced the theoretical basis for PE in the context 
of lazy functional logic programming. Since the preliminary publication of these re- 
sults, several extensions as well as concrete partial evaluators have been developed. 
In this section, we review some of these subsequent developments. 

The computational model of modern declarative multi-paradigm languages, which 
integrate the most important features of functional, logic and concurrent program- 
ming, is based on a combination of two different operational principles: needed 
narrowing and residuation planus 1997|) . The residuation principle is based on the 
idea of delaying function calls until they are sufficiently instantiated for a determi- 
nistic evaluation by rewriting. The particular mechanism (narrowing or residuation) 
is specified by evaluation annotations: deterministic functions are annotated as rigid 
(which forces a delayed evaluation by rewriting), while non-deterministic functions 
are annotated as flexible (which enables narrowing steps). 

Although NN-PE is originally formulated for functional logic languages based 
uniquely on needed narrowing, it is still possible to adapt it to the use of distinct 
operational mechanisms. In fact, NN-PE has been already adjusted to perform 
partial computations using the combined operational semantics described above 
HAlbert 20011 lAlbert et al. 1999|l . 

On the other hand, NN-PE has also been extended IjAlbert et al. 2002}) in order to 
make it viable for defining partial evaluators for practical multi-paradigm functional 



logic languages like Curry ( Hanus (ed.) 2003| ) or Toy ( |L6pez-Fraguas and Sanchez- Hernandez 1999| ). 



When one considers a practical language, several extensions have to be considered, 
e.g., higher-order functions, concurrent constraints, calls to external functions, etc. 
In order to deal with these additional features, the underlying operational calculus 
becomes usually more complex. As we mentioned earlier, an on-line partial eval- 
uator normally includes an interpreter of the language ( |Consel and Danvy 1 993). 
Then, as the operational semantics becomes more elaborated, the associated PE 
techniques become (more powerful but) also increasingly more complex. To avoid 
this problem, an approach successfully tested in other contexts (Bondorf 1989 
IGliick and Klimov 19931 |Nemytykh et al. 1996| ) is to consider the PE of programs 
written in a maximally simplified programming language. 



Hanus and Prehofer (1999 ) have introduced a Eat representation for functional 
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logic programs in which definitional trees are embedded in the rewrite rules by 
means of case expressions: 

Example H 

Function of Example ^ can be written in the flat representation as follows: 

X < Y = case X of { — > true; 

s(Xi) — > case Y of { — » false; 

b(Yi) -► Xi < Yi } } 

Two nice properties of the flat representation are that it provides more explicit 
control — hence the associated calculus is simpler than needed narrowing — and source 
programs can be automatically translated to the new representation. Moreover, it 
constitutes the basis of a recent proposal for an intermediate language, FlatCurry, 
used during the compilation of Curry programs ( |Antoy and Hanus 2000||Antoy et al. 2001| ). 
A new PE scheme IjAlbert 20011 1 Albert et al. 2002jl has been designed by consid- 
ering such a Rat representation for functional logic programs. 

However, the use of the standard semantics for flat programs — the LNT calcu- 
lus (Hanu s~and Prehofer 1999| . which is equivalent to needed narrowing — at PE 
time does not avoid the backpropagation of bindings when evaluating terms in 
constructor root-stable form, which can be problematic within a lazy context (see 
Example EJ. In order to overcome this problem, a residualizing version of the stan- 
dard semantics is introduced: the RLNT calculus l| Albert 20011 1 Albert et al. 2 003) . 
Finally, since modern lazy functional logic languages can be automatically trans- 
lated into this flat representation — which still contains all the necessary information 
about programs — the resulting technique is widely applicable. 

All these results laid the ground for the development of a partial evaluation 
tool for Curry programs, which has been distributed with the Portland Aachen 



Kiel Curry System (Hanus (ed.) et al. 20031 since April 2001. Our partial evalua- 



tor constructs optimized, residual versions for selected calls of the input program. 
These calls are annotated by means of the function PEVAL which is equivalent to 
the identity function. Let us show a typical session with the partial evaluator. Here 
we consider the optimization of a program containing several calls to higher-order 
functions (since it is common to use higher-order combinators such as map, f oldr, 
etc. in Curry programs). Although the use of such functions makes programs con- 
cise, some overhead is introduced at run time. Hence, we apply our partial evaluator 
to optimize calls to these functions. As a concrete example, consider the following 
(annotated) Curry program: 5 

main xs ys = (PEVAL (map (iter (+1) 2) xs)) ++ ys 

iter f n = if n==0 then f else iter (comp f f) (n-1) 
comp f g x = f (g x) 

bench = main [1.. 20000] [] 



5 Here we follow the Curry syntax: both variables and functions (except for PEVAL) start with 
lower case letters and function application is denoted by juxtaposition. 
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stored in the file map_iter . curry. Function comp is a higher-order function to com- 
pose two input functions, while iter composes a given function 2 n times. Thus, 
given two input lists, xs and ys, function main adds 4 to each element of xs — the 
annotated expression — and then concatenates the result with the second list ys. 
The built-in function "++" denotes list concatenation in Curry (more details can 



be found in (Hanus (ed.) 2003 1). In order to measure the improvement achieved by 
the process, we have also included the function bench with a simple call to function 
main, where [1. .20000] represents a list from 1 to 20000. First, we load the pro- 
gram into PAKCS, turn on the time mode (to obtain the run time of computations), 
and execute function bench: 

prelude> :1 map_iter 

compiled /tmp/map_iter .pi in module user, 620 msec 9888 bytes 
map_iter> :set +time 
map_iter> bench 
Runtime: 750 msec. 

Result: [5,6,7,8,9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19,20,21 ,22, .. .] 

Now, we run the partial evaluation tool and show the result of the process: 
map_iter> :peval 

Writing specialized program into "map_iter_pe .f lc" . . . 
Loading partially evaluated program "map_iter_pe" . . . 
map_iter_pe> : show 
main xs ys = (map_pe0 xs) ++ ys 

iter f n = if n==0 then f else iter (comp f f) (n-1) 
comp f g x = f (g x) 

bench = main [1.. 20000] [] 

map_pe0 [] = [] 

map_pe0 (x : xs) = ( ( ( (x + 1) + 1) + 1) + 1) : map_pe0 xs 

Only two modifications have been performed over the original program: the anno- 
tated expression has been replaced by a call to the new function map_pe0 and the 
residual (first-order) definition of map_pe0 has been added. In order to check the 
improvement achieved, we can run function bench again: 

map_iter_pe> bench 
Runtime: 170 msec. 

Result: [5 , 6 , 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14, 15 , 16 , 17, 18, 19 ,20 ,21 ,22 , . . . ] 

Thus, the new program runs approximately 4.5 times faster than the original one. 
The reason is that it has a first-order definition and is completely "deforested" 
(Wadlcr 1990) in contrast to the original definition. In fact, the most successful 
experiences were achieved by specializing calls involving higher-order functions (ob- 
taining speedups up to a factor of 9) and generic functions with some static data, 
like a string pattern matcher where a speedup of 14 was obtained; experimental 
results can be found in I) Albert et al. 2002 ) . 
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Note that all aforementioned proposals rely on the theoretical foundations pre- 
sented in this work. Therefore, our results constitute the basis for the correctness 
of all these developments. 

7 Conclusions 

Few attempts have been made to investigate powerful and effective PE techniques 
which can be applied to term rewriting systems, logic programs and functional 
programs. In this work, we have introduced the theoretical basis for the PE of 
functional logic programs based on needed narrowing. We have proved its strong 
correctness, i.e., that the answers computed by needed narrowing in the original 
and specialized programs for the considered goals are identical (up to renaming). 
Furthermore, we have proved that the PE process keeps the inductively sequential 
structure of programs so that the needed narrowing strategy can also be used for 
the execution of specialized programs. As a consequence, our PE process preserves 
the following desirable property for functional logic programs: deterministic evalu- 
ations w.r.t. the original program are still deterministic in the specialized program. 
This property is nontrivial as witnessed by counterexamples for the case of lazy 
narrowing. This allows us to conclude that PE based on needed narrowing provides 
the best known basis for specializing functional logic programs. 

To summarize, the notions presented in this article seem to be the most promising 
approach for the PE of modern functional logic languages based on a lazy semantics: 

• We have shown that a partial evaluator based on lazy narrowing may lead 
from orthogonal programs to programs outside this class. This is clearly im- 
proved by PE based on needed narrowing as it preserves the original (induc- 
tively sequential) structure of programs, which is the only requirement for the 
completeness of the method. 

• On the other hand, modern functional logic languages are based on (some form 
of) needed narrowing and, thus, this article is intended to be the foundational 
work in this area. 

Finally, as we mentioned before, current approaches to the PE of multi-paradigm 
functional logic languages IjAlbert et al. 19991 lAlbert et al. 2002)) rely on the theo- 
retical foundations presented in this work. Therefore, our results provide the necess- 
ary basis for the correctness of all these subsequent developments. 
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Appendix A Inductive Sequentiality of NN-PE 

In this section we proof Theorem |21 which states that partially evaluated programs 
are inductively sequential if the input programs have the same property. Firstly, 
this is only proved for PE w.r.t. linear patterns and, then, we extend this result to 
arbitrary sets of terms. 

Theorem 6 

Let TZ be an inductively sequential program and t be a linear pattern. If TZ' is a 
pre-NN-PE of t in 71, then TZ' is inductively sequential. 

Proof 

Due to the definition of pre-NN-PE, TZ' has the form 
ax(t) -> h 

a n (t) — * t n 

where t ~>+. ti, i = 1, . . . , n, are all the derivations in the needed narrowing tree for 
t ending in a non- failing leaf. To show the inductive sequentiality of TZ' , it suffices 
to show that there exists a definitional tree for the set S = {ai(t), . . . , cr„(£)} with 
pattern f(x^) if t has the p-ary function / at the root. We prove this property by 
induction on the number of inner nodes of the narrowing tree for t. 
Base case: If the number of inner nodes is 1, we first construct a definitional tree 
for the set S = {t} containing only the pattern at the root of the narrowing tree. 
This is always possible by Proposition Now we construct a definitional tree for 
the sons of the root by extending this initial definitional tree. This construction is 
identical to the induction step. 

Induction step: Assume that s is a leaf in the narrowing tree, a is the accumulated 
substitution from the root to this leaf, and V is a definitional tree for the set 

S = {0(t) | t s' is a derivation in the needed narrowing tree 
with a non-failing leaf s'}- 

Now we extend the narrowing tree by applying one needed narrowing step to s, i.e., 
let 

s "^ Vl Si 

be all needed narrowing steps for s. For the induction step, it is sufficient to show 
that there exists a definitional tree for 

S' = (S\{a(t)}) U Wi(a(t)), <p m W))Y 

Consider for each needed narrowing step s ^> Vi .% the associated canonical repre- 
sentation (j>, R, ipik, ° ••• ° ^ii) S A(s,7- > s ) (where V s is a definitional tree for the 
root of s). Let 

V' = V U {ip i:l ■ o ■ • • o tp n o a (t) | 1 < i < m, 1 < j < k} 
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We prove that V' is a definitional tree for S' by showing that each of the four 
properties of a definitional tree holds for V' . 

Root property: The minimum elements are identical for both definitional trees, i.e., 
pattern(V) — pattern(V'), since only instances of a leaf of V are added in V' . 

Leaves property: The set of maximal elements of V is S . Since all substitutions com- 
puted by needed narrowing along different derivations are independent (Lemma 
|y, a is independent to all other substitutions occurring in S and the substitu- 
tions . . . , tp m are pairwise independent. Thus, the replacement of the element 
a(t) in S by the set {<£>i(er(f)), . . . , (p m (a(t))} does not introduce any compara- 
ble (w.r.t. the subsumption ordering) terms. This implies that S' is the set of 
maximal elements of V' . 

Parent property: Let tt G P'\{pattern(P')} . We consider two cases for tt: 

1. tt G V: Then the parent property trivially holds since only instances of a 
leaf of V are added in V' . 

2. tt g" V: By definition of V' , tt = tpij o • • • o tpn o a(t) for some 1 < i < m 
and 1 < j < ki . We show by induction on j that the parent property holds 
for TT. 

Base case (j = 1): Then tt = <^ji(<r(f)). It is ipn ^ id (otherwise tt = 
cr(t) G V). Thus, by Proposition <p%\ — {x c(a^")} with a; G Var(s) C 
Var(a(<)). Due to the linearity of the initial pattern and all substituted 
terms (cf. Proposition^, o~(t) has a single occurrence o of the variable x 
and, therefore, tt — o~(t)[c(x^)] , i.e., tr(i) is the unique parent of tt. 
Induction step (j > 1): We assume that the parent property holds for tt' — 
fi.j-i °" ' 'O^ii oa{i). Let 7^ id (otherwise the induction step is trivial). 
By Proposition [3 tpij — {x 1— > c(x^)} with x G Var{<pij-i o • ■ • o o a{t)) 
(since Var(s) C Var(cr(f))). Now we proceed as in the base case to show 
that tt' is the unique parent of tt. 

Induction property: Let tt G V'\S' . We consider two cases for tt: 

1. tt G 'P\{cr(£)}: Then the induction property holds for tt since it already 
holds in V and only instances of a(t) are added in V' . 

2. tt = ifij o • ■ • o o o~(t) for some 1 < i < m and < j < h. Assume 
Vij+i 7^ *^ (otherwise, do the identical proof with the representation tt = 
Pij+i ' ' ■° ( Pii°o~(t)). By Proposition^ ifiij+i — {x 1— > c(o^)} and 7r has a 
single occurrence of the variable a; (due to the linearity of the initial pattern 
and all substituted terms). Therefore, tt' — fi.j+i o • • • o ipn o a(t) is a child 
of 7r. Consider another child tt" = ifi/ji o- ■ •o^/ 1 ocr(f) of tt (other patterns 
in V' cannot be children of tt due to the induction property for V) . Assume 
(fiiji o • • • o ip^i ^ tpij+i o • • • o (otherwise, both children are identical). 
By Lemma ^ there exists some I with yjj/j o • • • o ip iri = ip a o • • • o ip ill 
<Pi> t l+l = {a;' 1 * c'(---)}, and <^j+i = {a;' i-> c"(- ■ •)} with c' 7^ c". 
Since 7r" and 7r' are children of 7r (i.e., immediate successors w.r.t. the 
subsumption ordering), it must be x' = x (otherwise, tt' differs from tt 
at more than one position) and (pi> t j> — ■■■ — <Pi>,i+2 — id (otherwise, 
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7r" differs from tt at more than one position). Thus, 7r' and 7r" differ only 
in the instantiation of the variable x which has exactly one occurrence in 
their common parent it, i.e., there is a position o of tt with n\ — x and 
7r' = it[c' (x n i)] and ir" = w[c" (x n ")] . Since 7r" was an arbitrary child of 
7r, the induction property holds. 

□ 

Since actual partial evaluations are usually computed for more than one term, we 
extend the previous theorem to this more general case. 

Corollary 2 

Let TZ be an inductively sequential program and S be a finite set of linear patterns 
with pairwise different root symbols. If TV is a pre-NN-PE of S in TZ, then TZ' is 
inductively sequential. 

Proof 

This is a consequence of Theorem since we can construct a definitional tree for 
each pre-NN-PE of a pattern in S. Since all patterns have different root symbols, 
the roots of these definitional trees do not overlap. □ 

Now we are able to show that the NN-PE of an arbitrary set of terms — w.r.t. an in- 
ductively sequential program — always produces an inductively sequential program. 

Theorem^ 

Let TZ be an inductively sequential program and S a finite set of operation-rooted 
terms. Then each NN-PE of 1Z w.r.t. S is inductively sequential. 

Proof 

Let 1Z' be a pre-NN-PE of 1Z w.r.t. S and p an independent renaming of S. Then 
each rule of a NN-PE 1Z" of 1Z w.r.t. S (under p) has the form 9(p(s)) — > ren p {r) 
for some rule 6{s) — > r € 7Z' . Consider the extended rewrite system 

TZ P = TZU {p(s) -»• s | s e S} 

where the renaming p is encoded by a set of rewrite rules. Note that 7Z P is inductively 
sequential since the new left-hand sides p(s) are of the form/ s (av^) with new function 
symbols f s . 

Let TZ' p be an arbitrary pre-NN-PE of 1Z P w.r.t. p(S). Since p(S) is a set of 
linear patterns with pairwise different root symbols, 1Z' is inductively sequential by 
Corollary El It is obvious that each subset of an inductively sequential program is 
also inductively sequential (since only the left-hand sides of the rules are relevant 
for this property). Therefore, to complete the proof it is sufficient to show that all 
left-hand sides of rules from 1Z" can also occur as left-hand sides in some 1Z' . 

Each rule of 1Z" has the form 8(p(s)) — > ren p (r) for some rule 0(s) — > r G TZ! . 
By definition of TV , there exists a needed narrowing derivation s r w.r.t. TZ. 
Hence, 

p{s) ^>id s r 
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is a needed narrowing derivation w.r.t. TZ p . Thus, 9(p(s)) — > r is a resultant which 
can occur in some TZ' p . □ 

Appendix B Strong Correctness of NN-PE 

In this section, we prove Theorem i.e., the strong correctness of NN-PE, and 
introduce some necessary auxiliary notions and results for this proof. The proof 
proceeds essentially as follows. Firstly, we prove the soundness (resp. completeness) 
of the transformation, i.e., we prove that for each answer computed by needed 
narrowing in the original (resp. specialized) program there exists a more general 
answer in the specialized (resp. original) program for the considered queries. Then, 
by using the minimality of needed narrowing, we conclude the strong correctness 
of NN-PE, i.e., the answers computed in the original and the partially evaluated 
programs coincide (up to renaming). 

In order to simplify the proofs, we assume (without loss of generality) that the 
rules of strict equality are automatically added to the original as well as to the 
partially evaluated program. We also assume that the set of specialized terms always 
contains the calls x ~ y and x Ay, and by abuse we consider that p does not modify 
these symbols. This allows us to handle the strict equality rules in TZ' as ordinary 
resultants derived from the one-step needed narrowing derivations for the calls 
x « y and x A y in 1Z. 

B.l Soundness 

The following lemmata are auxiliary to prove that reduction sequences in the spe- 
cialized program can also be performed in the original program (up to renaming of 
terms and programs). 

Lemma 3 

Let TZ be an inductively sequential program and s be an operation-rooted term. 
Let s ^+ r be a needed narrowing derivation w.r.t. TZ whose associated resultant 
is R = (ct(s) — > r). If t — > p ,fl t 1 for some position p £ Vos(t), then t t 1 w.r.t. 
TZ. 

Proof 

Given the derivation s ^+ r, by soundness of needed narrowing (claim 1 of The- 
orem we have cr(s) — > + r. Since t —>p,u t', there exists a substitution such 
that 9(a(s)) = t\ p and t' — t[6(r)] p . Since a(s) r, by stability of rewriting, we 
have 6(a(s)) ^ + 9(r). Therefore t = t[6(a(s))] p ^+ t[9(r)] p = t' w.r.t. TZ, which 
concludes the proof. □ 

Lemma 4 

Let S be a finite set of terms and p an independent renaming for S . Let R = (0(s) — * 
r) be a rewrite rule such that 9 is constructor and s £ S, and let R' = (I' — * r') 
be a renaming of R where /' = 9(p(s)) and r' = ren p (r). Given a term t\ and 
one of its renamings t[ — ren p {ti), if t[ —* p ' ri t' 2 then t\ —>p,R h, where p is the 
corresponding position of p' in t[ and t' 2 — ren p (t,2). 
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Proof 

Immediate by definition of ren p . □ 

The following proposition is the key to prove the soundness of NN-PE. 
Proposition 5 

Let 72. be an inductively sequential program. Let e be an equation, S a finite set 
of operation-rooted terms, p an independent renaming of S, and 72/ a NN-PE of 
72 w.r.t. S (under p) such that 72/ U {e'} is S"-closed, where e' — ren p (e) and 
S' = p(S). If e' — >* true in 72/ then e — >* true in 72.. 

Proof 

We prove the claim by induction on the number n of rewrite steps in e' — >* true 
(considering e' an arbitrary 5"-closed expression). 

Base case. If n = 0, we have e' = true and the claim trivially follows since 
ren p (true) — true by definition. 

Inductive case. Consider a rewrite sequence of the form e' —> p \r> h' — >* true with 
R' = (I' —> r'). By definition of NN-PE, R' has been obtained by applying the 
post-processing renaming to a rule R = (9(s) — > r) in the pre-NN-PE, where 9 
is constructor, I 1 — 9{p(s)), and r' — ren p (r). By Lemma0] we have e ^ p ,_r h 
where p is the corresponding position of p' in e' and /i' = ren p (h). By definition of 
pre-NN-PE, there exists a needed narrowing derivation s r which produced 
the resultant R. Since e ^ p ,r h, we have e h in 72. by Lemma |3 
Since the terms in S' are linear and 72.' is 5"-closed, h! is trivially S"-closed. By 
applying the inductive hypothesis to the subderivation h! — >* true in 72.', there 
exists a sequence h — >* true in 72.. Together with the initial sequence e h we 
get the desired derivation in 72.. 

□ 

Now we state and prove the soundness of NN-PE. 
Theorem 7 

Let 72 be an inductively sequential program. Let e be an equation, V 3 Var(e) a 
finite set of variables, S a finite set of operation-rooted terms, and p an independent 
renaming of S. Let 72/ be a NN-PE of 72 w.r.t. S (under p) such that 72' U {e'} is 
S"-closed, where e' = ren p {e) and S' = p(S). If e' £rwe is a needed narrowing 
derivation for e' in 72.', then there exists a needed narrowing derivation e irue 
in 72 such that (a < cr')[V]. 

Proof 

Since e' , true in 72/ and 72/ is inductively sequential (Theorem [3J , by the 
soundness of needed narrowing (claim 1 of Theorem 0, we have cr'(e') ^* true. 
Since e' is S"-closed and a' is constructor, by definition of closcdncss, cr'(e') is also 
5'-closed and cr'(e') = ren p (a' (e)) . By Proposition^ there exists a rewrite sequence 
o~'(e) — >* true in 72. Therefore, by the completeness of needed narrowing (claim 2 
of Theorem ^1 , there exists a needed narrowing derivation e -^>* true in 72. such 
that (a < cr') [ V], which completes the proof. □ 
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B.2 Completeness 

Firstly, we consider the notions of descendants and traces. Let A — (t — >„ ) ;_ >r t') 
be a reduction step of some term t into t' at position u with rule I — > r. The set of 
descendants ( |Huet and Levy 1992| ) of a position v oi t by A, denoted v\A, is 

!0 if u = v, 

{v} ifu^v, 
{u ■ p' ■ q | r\ p i = x} if v = u ■ p ■ q and l\ p — x, where x E X- 

The set of traces of a position v of t by A, denoted v\A is 

{{d} if u = v, 

{v} if u ^ w, 

{u • p' • 3 | r|p/ = a;} if u = u • p • q and i| p = x, where x E X- 

The set of descendants of a position u by a reduction sequence B is defined induc- 
tively as follows 

{{v} if B is the null derivation, 

(J w\5" if 5 = B'B", where 5' is the initial step of B. 
wev\B' 

Given a set of positions P, we let P\B = {J peP p\B- The definition of the set of 
traces of a position by a reduction sequence is perfectly analogous. 

A redex s in a term t is root-needed, if s (itself or one of its descendants) is con- 
tracted in every rewrite sequence from t to a root-stable term ( Mi ddeldorp 1997| ) . 

In the remainder of this section, we consider outermost-needed reduction se- 
quences as defined 6 in flAntoy et al. 2 000). 

Definition 8 \Antoy 199^ 

Let 1Z be an inductively sequential program. The (partial) function ip takes argu- 
ments t = f(t) for a given / 6 J 7 , and a definitional tree 7 V such that pattern(T) < 
t, and yields a redex occurrence p S 'Pos-iz(t) called an outermost-needed redex: 

A ifV= {tt} 

ip(t, Vi) if V = branch(n, p, Vi, ■ ■ ■ , V n ) 

and pattern{Vi) < t for some i, 1 < i < n 
p ■ <p(t\ p ,Vg) if V = branch(7r,p,Vi, . . . ,V n ), 
root(t\ p ) = g G and 
V g is a definitional tree for g- 



<p(t,V) = < 



The following notations and terminology are needed for the subsequent develop- 
ments. Positions u, v are disjoint, denoted u _L v, if neither u < v nor v < u. 
Also, for a set of (pairwise disjoint, ordered) positions P = {pi, . . . , p n }, we let 



" This is a slightly different though equivalent definition, since we do not allow for exempt nodes, 
as in |Antoy 1992} . 

7 In this definition, we write branch(-K, p, "Pi, . . . , Vn) for a definitional tree V with pattern tt if tt is 
a branch node with inductive position p and children ni, . . . , n n where Vi = {tt 1 & V \ tti < tt 1 }, 
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t[si, . . . , s n ]p = (({t[si] Pl )[s2)p 2 ) ■ . • [sn]p,J- A term is root-normalizing if it has a 
root-stable reduct. If there are rules I — > r and /' — ► r' and a most general unifier 
a for Z| p and I' for some position p, the pair (c(Z)[<T(r')]p, u(r)) is called a critical 
pair and it is also called an overlay if p = A. A critical pair {t, s) is trivial if t = s. 
A TRS is called almost orthogonal if its critical pairs are trivial overlays. If all crit- 
ical pairs are trivial, a TRS is called weakly orthogonal. Note that, in CB-TRSs, 
almost orthogonality and weak orthogonality coincide. The inner reduction relation 
is -^>a = — >\—>A- The following technical results are auxiliary. 

Lemma 5 \Middeldorp 199T[ 

Let TZ be an almost orthogonal TRS. If t is root-stable and s-^> A i, then s is 
root-stable. 

Theorem 8 

Let TZ be an inductively sequential program and t be a non-root-stable term. Every 
outermost-needed redex is root-needed. 

Proof 

By Theorem 18 in ( Ha nus et al. 1998^1 . outermost- needed redexes are addressed by 
strong indices. By Theorem 5.6 in IjLucas 1998(1 . nv-indices (hence strong indices, 
see ( |Oyamaguchi 19 93)) in non-root-stable terms address root-needed redexes. □ 

Theorem 9 

Let TZ be a weakly orthogonal CB-TRS and t be a term. Let P = {pi , . . . , p n } C 
Vos(t) be a set of disjoint positions of t such that each t\ Pi for 1 < i < n is 
operation-rooted. If t admits a root-normalizing derivation which does not root- 
normalize any t\ Pi , then t admits a root-normalizing derivation which does not 
reduce any t\ Pi . 

Proof 

If t is root-stable, the result is immediate. If t is not root-stable, then there exists a 
root-stable reduct a(r) and a derivation A : t ^* a (I) c(r) for some rule I — > r 
in TZ which, by hypothesis, root-normalizes t without root-normalizing any t\ Pi . Let 
= y\ , . . . , y n be new, distinct variables each of which is used to name a subterm 
t\ Pi . The substitution 9 t defined by 9t{yi) — t\ Pi associates a subterm to each 
variable. Note that O t (t[y^\p) — t. As an intermediate step of the demonstration, 
first we prove, by induction on the length N + 1 of the derivation A, that there 
exists a substitution a' such that t[y^]p cr'(l) and 9 t (a'(x)) a(x) for all 
x E Var(l). 

First we note that, since the derivation A does not root-normalize any t\ Pi , we 
have that pi > A for every 1 < i < n (otherwise P — {A} and we obtain a 
contradiction with the initial hypothesis). 

1. If N = 0, then t — cr(l). Since pi > A, t\ Pi is operation-rooted for 1 < i < n, and 
TZ is constructor based, then for each pi there exists a variable position Vi G 'Pos(l) 
such that pi — Vi ■ Wi and l\ Vi is a variable. Then, for each x G Var(l), we let 
<t'(x) — t\y^]p\ Vx where v x is the position of x in /. Hence, t\y^\p = <j'{1) and 
6 t (a'(x)) = t (t[y^]p\ v J = a(x) for each x E Var(l). Thus, 9 t (a'(x)) ->* a(x). 
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2. If N > 0, then we consider the derivation t —* g t' — ►* a (I). Let P' = q\P = 
{p[, . . . , p' n , } be the traces of P w.r.t. the rewriting step t — >- g t' (note that the traces 
are well defined since every t\ Pi is operation-rooted). By hypothesis, the derivation 
t — ►* a (I) — > <r(r) does not root-normalize any t\ p for p G P. In particular, the step 
t — f g t' does not root-normalize any t\ p for p E P. Therefore, each t'\ p i for every 
p' G P' is operation-rooted and the derivation t' — ►* a{l) — » a(r) does not root- 
normalize any t'\ p i for p' G P'. Thus, by the induction hypothesis, t'[zn']p' — ►* o~'(l) 
and 9 t i(a'{x)) — ►* a(x) for all x G Var(l) where = • • • , z n > are new, distinct 
variables which identify the subterms in t' addressed by P', i.e., 9 t '{zi) = t'\ p i 
for 1 < i < n'. We connect variables in and variables in by means of a 
substitution r : z^T — > tfo as follows: r(zj) = j/j iff p£ is a trace of (w.r.t. the step 
t — > f') for 1 < i < n' and 1 < j < n. Now we consider two cases: 

(a) If there is no p G P such that p < q, then, since each t\ Pi is operation-rooted 
and 1Z is constructor-based, we have that t[y^]p — > q T(t'[z n ']pi). Moreover, 
since no t\ Pl changes in this rewriting step, we have 0t{r{z)) = 9 t >(z) for all 
z G ~z^7, i.e., 9 t ' — 9 t or. Since t'\z^i\pi — ►* cr'(l), by stability, r(t'[z^/]p/) — >* 
t(ct'(0). Thus, -+* r((r'(0). Since 9 v {a'{x)) = 9 t ( T {a'(x))) -+* a(x), 
the conclusion follows. 

(b) If there is p G P such that p < q, then P' = P , n = n' and we can take = 
z^7. Hence, t[y^] P = t'[y^] P = t'[z^f\ P >. Now we have that 6t{Vi) — > 0t'(#») 
if p = pi, for some 1 < i < n whereas 9t(yj) = ^t'(Vj) for all j ^ i, and the 
conclusion also follows. 

Since Q t (o'{x)) <r(x) for all x G Var(l), we consider two possibilities: 

1. If r ^ A", then, since Var(r) C Var{l), we have that 9 t (a'(r))— *%&o-(r). 

2. If r = x G A", we prove that this implies that t (a'(r)) = 6t(o-'(x))— >-t A cr(a;) = 
<r(r). Otherwise, it is necessary that cr'(:r) be a variable. In this case, it must be 
o-'(x) = yi for some 1 < i < n (otherwise, 9 t (cr'(x)) — cr'(x) is a variable and 
it cannot be rewritten to cr(x) in zero or more steps unless <y'{x) — o{x) in which 
case, we trivially have that t (a'(x))— >>\0-(x)). Since a(r) = a(x) is root-stable, the 
existence of the derivation 9t(o~'(x)) = 9 t {yi) — >* cr(x) implies (since each reduction 
step in the derivation 9 t (a'(x)) — >* <r(x) has been taken from the derivation A) that 
the derivation A root-normalizes the subterm 9t(yi) — t\pf This contradicts our 
initial hypothesis. 

Thus, in all cases, we have that 9t(o-'(r))—>t >A a(r) and, since a(r) is root-stable, by 
LemmaEl (remember that weak orthogonality and almost orthogonality coincide for 
CB-TRSs), 9 t {<j'{r)) is root-stable. Note that we have also proved that t[y^] p — >* 
cr'(l) — > er'(r) and therefore, by stability, t — 9 t {t\y^\p) ^* 9 t (a'(r)) is a root- 
normalizing derivation for t which does not reduce any t\ Pi for 1 < i < n. □ 

Theorem 19 

Let 7Z be a weakly orthogonal CB-TRS and t be a term. Let P = {pi , . . . , p n } C 
Pos(t) be a set of disjoint positions of t such that each t\ Pi for 1 < i < n is a 
root-stable, operation-rooted term. If t is root-normalizing, then t admits a root- 
normalizing derivation which does not reduce any t\ Pi . 
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Proof 

The proof is perfectly analogous to the proof of Theorem Assume the same 
notations for the proof. Only one difference arises in the last part of the proof: we 
need not distinguish the cases r G X and r ^ X. This is because the fact that each 
t\ Pi is root-stable and the fact that 0t(cr'(x)) ^* cr(x) for all x G Var(l) allows 
us to immediately derive that 0t(a'(x))— >y A a(x). Now, it suffices to consider that 
every t\ Pi and their possible reducts are operation-rooted, which easily follows from 
the fact that each t\ Pi is root-stable and operation-rooted. □ 

Theorem 11 

Let 7Z be a weakly orthogonal CB-TRS, t be a term, and p G Vos(t). Let s be a 
root-stable, operation-rooted subterm of t. If t is root-normalizing, then s does not 
have redexes which are root-needed in t. 

Proof 

Immediate, by using Theorem 1101 □ 
Theorem 12 

Let 1Z be a weakly orthogonal CB-TRS and t be a term. If s is an operation-rooted 
subterm of t that contains a redex which is root-needed in t, then every root-needed 
redex in s is root-needed in t. 

Proof 

Since t contains at least a root-needed redex, then t is not root-stable. If t has 
no root-stable form, it is trivial, since every redex is root-needed in t. Hence, we 
assume that t has a root-stable reduct. Let s\ q be a root-needed redex in s. Then, 
s is not root-stable. Let s| 9 < be a root-needed redex in t. If s\ q is not root-needed in 
t, then it is possible to root-normalize t without reducing the redex s\ q . However, 
without reducing the redex s\ q it is not possible to root-normalize s. Therefore, it is 
possible to root-normalize t without root-normalize s. By Theorem^ it is possible 
to root-normalize t without reducing s, hence without reducing s| 9 <, which yields 
a contradiction. □ 

The following auxiliary definition is useful to deal with closed terms (it is a slight 
refinement of the same notion in ( |Alpuente et al. 1998a| )). 

Definition 9 {covering set, closure set) 

Let S be a finite set of terms and t be an S'-closed term. We define the covering set 
of t w.r.t. S as follows: 

CSet{S, t) = {0\0 G c.set(S, t), (u ■ OJail) £ 0,ue IN*} 

where the auxiliary function c_sei, used to compute each closure set O, is defined 
inductively as follows: 

c_set(S, t) 3 

'0 if t G X U C, 

U"=i{(* " Pi s ) I (Pi s ) e cset(S, ti)} if t = c(tn), c G C* , 

< {(A, s)} U {(q ■ p, s') | s\ q G X, (p, s') G c.set{S, 0{s\ q ))} if 3s G S s.t. 0(s) = t 

for some 9, 

k {(0,/aiZ)} otherwise. 
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where C* = C U {«, A}. Note that positions ending with the mark "0" identify the 
situation in which some subexpression of t is not an instance of any of the terms in 
S. Thus, a set containing a pair of the form (u ■ 0,fail) is not considered a closure 
set. 

Roughly speaking, given a set of terms S and a term t which is S'-closed, each 
set in CSet(S, t) identifies a concrete way in which t can be proved S'-closed, thus 
avoiding the non-determinism which is implicit in the definition of closedness. 

The following lifting lemma is a slight variant of the completeness result for 
needed narrowing. 

Lemma 6 

Let 1Z be an inductively sequential program. Let a be a constructor substitution, 
V a finite set of variables, and s an operation-rooted term with Var(s) C V. If 
c(s) — ^■ p1i r 1 ■■■ — > P „,R n t is an outermost- needed reduction sequence, then there 
exists a needed narrowing derivation s ^ ^.ffj • • • ^ Pnt R nt a n t' and a constructor 
substitution a' such that o'{t') — t and (a' o <j n o • • • o a 1 = a) [V]. 

Proof 

It is perfectly analogous to the proof of Theorem 4 (completeness) in ( |Antoy et al. 2 000 ) . 
□ 

Now we prove two technical results which are necessary for a useful generalization of 
the lifting lemma. We need to make the lemma applicable even when the considered 
substitution is not constructor, as long as it still does not introduce a needed redex. 
In order to do this extension, we need to ensure that it is possible to get rid of 
some operation-rooted subterms which are introduced by instantiation whenever 
they are not contracted in the considered derivation. We prove this in the following 
lemmata. 

Lemma 7 

Let 1Z be program. Let t and s be operation-rooted terms and Po Q Vos(t) be a 
nonempty set of disjoint positions such that t\ p — s for all p £ Po- Let 

t[s, . . . , s]p = t — ^pj^j • • • —>p ni R n t n — t'[s, . . . , s]p n 

be a reduction sequence where Ai — — * Pi ^ U) and Pj = Pj-^ylj for all 
i = 1, . . . , n, n > 0. If p ^ pi for all p e Pi— i, i = 1, . . . , n, then there exists a 
reduction sequence 

t[x, . . . , x] Po ^ Pu R 1 ■ ■ ■ -^p n ,R. n t'[x, . . . , x} Pn 

Proof 

By induction on the number n of steps in the former reduction: 
n = 0. Trivial. 

n > 0. Consider A± = (t[s, . . . , s]p — > P1j .r 1 t"[s, . . . , s]p 1 ), where Pi = (h — > r±), 
o~i{h) = t\ Pl , and Pi = P \Ai. We distinguish two cases depending on the 
relative position of p\ (the case p < pi, for some p 6 Po, is not considered since 
the subterms in s are not contracted, i.e., p i^p\ for all p £ Po): 
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Vp £ Pq ■ pi _L p. In this case, we have that <T\{li) = {t[x, . . . , s]p )| Pl and, by 
definition of descendant, Pq = P\. Therefore t[x, . . . , x]p a —*p u R y t"[x, . . . , x]p 
and the claim follows by applying the inductive hypothesis to the sequence 

B = (t"[s, . . . , s]p -> P2 ,r 2 > Pn ,R n t'[s, . . . , s]pJ, 

where P n — Pq\B. 

3p £ Pq ■ pi < p. Since s is operation- rooted and l± is a linear pattern, then 
there exists a substitution a[ such that <j'i(h) = (t[x, . . . ,x]p )\ Pl (i.e., {a; i— » 
s} o a[ = <ti). Therefore, the reduction step 

t[x, x] Po ~* Pu r 1 t"[x, x] Pl 

exists and the claim follows by applying the inductive hypothesis to 

□ 

Lemma 8 

Let 1Z be a program. Let 9 — {x\ i— > si, . . . , a; m i— > s m } be an idempotent sub- 
stitution such that s.j is an operation- rooted term for all i — 1, . . . , m. Let s be 
an operation-rooted term and 9(s) — to -^ Pi ,r 1 ■ ■ • — *p n ,fl n £ « be a reduction se- 
quence where >L = — > p ,,p, U) and Pi = Pi_i\Ai, for i = 1, . . . , n, n > 0. 
If P ^ Pi for all p 6 Pi-i, i = 1, . . . , n, then there exists a reduction sequence 

S —tpuR! ■ ■ ■ -^p n ,Rn S ' SUCn tliat ^( S ') = 

Proof 

By induction on the number m of bindings in 9: 

Base case. Consider 9 = {x\ ^> s±}. We have 9(s) = <o[si, . . . , si]p and s = 
tg[xi, . . . , x\]p, where P — {p £ Vos(s) \ s\ p = x\). Then, the claim follows 
directly by Lemma 

Induction step. Consider 6 = Ox U 9', where 9\ = {xi i— > si} and 9' — {x2 i— > 
s 2 , . . . , z m «m}- Then, 0(s) = <b[si, . . . , si]p and 0'(s) = fo^, . . . , Xi] P , where 
P = {p G Pos(s) | s|p = xi}. Applying Lemma we have that 

io[xi, . . . , ii]p -^ Plt R 1 ■ ■ ■ — > p „,.r„ s 

is a reduction sequence such that 0i(s") = t n . By applying the inductive hy- 
pothesis to this derivation, we have that s — • ■ • ~^p n ,R n s' is a reduction 
sequence such that 9'(s') = s" . Therefore, since T>om(9\) C\T>om(9') = 0, we get 
9(s') = (9\ o 9')(s') = 9\{s") = t„,, which proves the claim. 

□ 

Now we are ready to extend the lifting lemma for needed narrowing (Lemma [SJl to 
non-constructor substitutions which do not introduce needed redexes. 



B — (t [s, . . . , s]p 1 ^ P2 ,r 2 ■ ■ ■ 
where P n = PAB. 
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Theorem 13 

Let 1Z be an inductively sequential program. Let a be a substitution and V a 
finite set of variables. Let s be an operation-rooted term and Var(s) C V. Let 
o~(s) —*-p 1 ,R 1 ■ ■ ■ —>p n ,R n t be an outermost-needed rewrite sequence such that, for 
all root-needed redex a(s)\ p of cr(s), p G AfVPos(s). Then, there exists a needed 
narrowing derivation s ■^ Pi .r 1 ^ 1 • ■ ■ ^p„,fl„,<r„ i' and a substitution a' such that 
cr'(i') = £ and (a' o a n o • • • o a% = a) [V]. 

Proof 

We consider two cases: 

a is a constructor substitution. In this case, the claim follows directly by applying 
Lemma and a' is a constructor substitution too. 

a is a non-constructor substitution. Then, there exist substitutions 9\ and 02 such 
that a — 02 ° &i, the substitution Q\ is constructor, and for all x ^ s' G 02, 
s' is operation-rooted. Then cr(s) = ^2(^1(5)). By applying Lemma|Hl we have 
0i(s) — ■■• — ^pn.iin s" such that 02{s") = t. On the other hand, since 
a does not introduce root-needed redexes (i.e., if <j(s)\ p is a root-needed redex 
then p G NVV os(s)), then the sequence is an outermost-needed derivation. Now, 
applying Lemma HO to this reduction sequence, there exists a needed narrowing 
derivation s ^■ plt R lt a-i •*■ ^p„,R n ,a n t' and a constructor substitution cr" such 
that a"(t') = s" and (a" o cr n o • • • o o\ = By taking cr' = 2 o cr", 

we have a'(t') = (6» 2 o cr")(i') = 6» 2 (cr"(i')) = 9 2 {s") = t. Finally, since cr" o 
cr n o • • • o (j\ = Oi)[V], we have (0 2 o a" o <j n o • ■ ■ o a\ — 02 0i[V], and hence 
(cr' o cr„ o • • • o o\ — cr) [ V] , which completes the proof. 

□ 

The next lemma establishes a strong correspondence between the closedness of an 
expression t and that of one renaming of t. 

Lemma 9 

Let S be a finite set of terms, p an independent renaming of S, and S' = p(S). 
Given a term t, ren p (t) is 5"-closed iff t is 5-closed. 

Proof 

By induction on the structure of the terms. □ 

The following lemma states that, if some term t has an operation-rooted subtcrm 
s that contains a redex which is root-needed in t, then the outermost-needed redex 
in s is also root-needed in t. 

Lemma 10 

Let 1Z be an inductively sequential program and t be a term. If s is an operation- 
rooted subterm of t which contains a root-needed redex in t, then every outermost- 
needed redex in s is root- needed in t. 
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Proof 

Since t contains at least a root-needed redex, t is not root-stable. If t has no root- 
stable form, then every redex in t is root-needed. Therefore, we assume that t has 
a root-stable reduct. If s contains an outermost-needed redex, then, by hypothesis 
and by Theorem ^2 s is not root-stable. Hence, by Theorem [SJ such a redex is 
root-needed in s. By Theorem II 21 the conclusion follows. □ 

The following lemma is helpful. 

Lemma 11 

Let TZ be an inductively sequential program and t be a term. If s is an operation- 
rooted subtcrm of t which contains a root-needed redex in t and there is a sub- 
term s' of s which does not contain any root-needed redex in t, then there is no 
outermost-needed derivation from s to a root-stable form which contracts any redex 
(or residual) in s'. 

Proof 

If s is root-stable, the claim is trivially true. If s is not root-stable and there is 
an outermost-needed derivation starting from s which contracts a (residual of a) 
redex s" in s' , then, by Theorem |H1 such a redex is root-needed in s. Therefore, by 
Theorem II 21 s" is root-needed in t, thus leading to a contradiction. □ 

Proposition 6 

Let TZ be an inductively sequential program. Let e be an equation, S a finite set 
of operation-rooted terms, p an independent renaming of S, and TZ' a NN-PE of 
TZ w.r.t. S (under p) such that TZ' U {e'} is 5"-closed, where e' = ren p (e) and 
S' = p(S). If e — >* true in TZ then e' — ►* true in TV . 

Proof 

Since e' is S"-closed, by Lemma e is 5-closed. Now we prove that, for any re- 
duction sequence e —>* true in TZ for an 5-closed term e (not necessarily an equa- 
tion), there exists a reduction sequence e' — >* true in TZ' with e' — ren p (e). Let 
B\, . . . , Bj be all possible needed reduction sequences from e to true and ki the 
number of contracted redexes in Bi, i = 1, . . . , j. We prove the claim by induction 
on the maximum number n = max{k\ 1 . . . ,kj) of contracted needed redexes which 
are necessary to reduce e to true. 

n = 0. This case is trivial since e' = ren p (true) = true. 

n > 0. Since e is 5-closed, there exists a closure set {(pi, si), . . . , (p m , s m )} 6 
CSet(S, e), m > 0, where pi S Pos(e) and Si 6 5 1 , i = 1, . . . , m. Since e contains 
at least one needed redex, there exists some i G {1, . . . , m} such that e\ Pi = 9{si) 
and the following facts hold: 

• there exists at least one position q £ Pos(e\ Pi ) such that e\ Pi . q is a needed 
redex in e, and 

• for all needed redex e\ Vi . q i in e, we have q' e TVVPos^). 
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Informally, addresses an "innermost" subterm of e (according to the partition 
imposed by the closure set) in the sense that e\ Pi contains at least one needed 
redex and there is no inner subterm e\ P] , pi < Pj, which contains needed redexes. 
Since both e and e\ Pt are operation-rooted terms, by Lemma 1101 we know that 
each outermost-needed redex in e\ Pi is also a needed redex in e (note that, for 
derivations e — ►* true in confluent TRSs, the notions of neededness and root- 
neededness coincide, since true is the only root-stable form of e). Let us assume 
that q\ is the position of such an outermost-needed redex. Since there is no inner 
subterm e\ Pj , pi < Pj, which contains needed redexes in e, then by Lemma |l II 
we have that there is no inner subterm e\ Pj , pi < pj, which contains root-needed 
redexes in e\ Pi . Hence, we can consider a reduction sequence 

such that the corresponding sequence for 9(si) 

@(Si) > q 1 ,Rl ■ ■ ■ > q k -l,Rk-l S i > qk,Rk S i 

is outermost-needed, is root-stable, and s'/ is not root-stable, k > 0. 
Now, we prove that s[ is constructor-rooted. Assume that s[ is operation-rooted. 
Then, since t' — e[s' i \ Pl is root-normalizing, by Theorem ^| there exists a reduc- 
tion sequence t 1 — >* true which does not reduce s[. Since s[ is operation-rooted 
and 1Z is constructor-based, then there exists a reduction sequence e[x] Pi — >* true, 
with x ^ Var(e). Therefore, e — >* true without reducing e\ Pi , which contradicts 
the initial hypothesis that e\ Pi contains a root-needed redex in e. Hence, s[ is 
constructor-rooted. 

Let V be a finite set of variables containing Var(sj). By Theorem 1131 we know 
that there exists a needed narrowing derivation Si -^ gii fl liffl ... ^q k ,R k ,a k s" 
which contracts the same positions using the same rules and in the same order. 
By definition of NN-PE, some resultant of TV derives from a prefix of this needed 
narrowing derivation. Assume that the following subderivation 

Si ~-*q u R u at ■ ■ ■ ^q,,R,,a, t' , < j < k 

is the one which has been used to construct such a resultant. Let a" = a 3 • o • • -oo\. 
Since 0(s{) -^ qi ,R 1 . . . - > o,-,fl< t> again by Theorem 1 181 there exists a substitution 
a' such that a'(t') = t and (a' o a" = 9)[V]. Thus, the considered resultant has 
the form 

Ri = (a"(p( Si )) - ren p {t')) 

and the considered reduction sequence in 1Z has the form 

e = e[6(si)} Pi -> Pl . qi , Rl . . . -^pi-q^Rj e[t] P( ^* true 

Now, we prove that e' can be reduced at position p[ using i?', where p\ is the 
corresponding position of pi in e after renaming. By construction, 6(x) is S'-closed 
for all x £ T>om(6). Moreover, since a" is constructor and (a' o a" — 0)[V\, 
we have that a'(x) is also 5-closed for all x G T>om(a'). Then, there exists 
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a substitution 9' = {x i— » ren p (a'(x)) \ x £ Vom{a')} such that 9'{x) is 5"- 
closed for all x £ T>om(9'). By definition of post-processing renaming, e'\ p i — 
ren p (e\ Pi ) = ren p (9(si)). Since Var(si) — V ar{p{si)) and a" is constructor, we 
have ren p {9( Sl )) = ren p {a' W '(«»)) = 0'(o-"(raip(si))) = 6»'(ct"(p(s 1 ))). Therefore, 
the following rewrite step can be proved 

e'\ pi = 9'{<j"(p{ Sl ))) -+ KR , 9\ren p (t')) = ren p (a'(t')) = ren p (t) 

and thus e' — > p ' t p' e' [ren p (t)] p i . Then, it is immediate to see that e [ren p (t)] p ( 
= ren p (e[t] Pi ). 

Let us now consider the 5-closedness of e[t] Pi . Since TZ' is 5"-closed, ren p (t') 
is also S"-closed. By Lemma |§1 t' is S'-closed. Since cr'(x) is 5-closed for all 
x £ T>om(cr'), by definition of closedness, cr'(t') = t is also 5-closed. Now we 
distinguish two cases: 

Pi = A. Then e[t] Pi is trivially 5-closed since t is 5-closed. 

Pi 7^ A. Let j £ {1, . . . , m} such that pj < pi and there is no k £ {1, . . . , m} 
with pj < pk < Pi- Let e\ Pj = j{sj) where y i— > Si £ 7, and consider the set 
P y = {pj ■ q £ {pi, . . . ,p m } I Sj\ q = y}. Now we have two possibilities: 

P y is a singleton. Then e[t] Pi is trivially ^-closed, since (pi,Si) £ CSet(S,e) 
and t is S'-closed. 

P y is not a singleton. In this case, we have e = e[0(sj), . . . , 0(si)]p . By con- 
sidering again the reduction sequences 9(si) — ►* i for each Sj, we get 

e[0(*),... > e[t,...,t] P , 

and, by definition of closedness, it is immediate to see that e[t, . . . , t]p is 
S'-closed. Moreover, we can construct the following reduction sequence: 

e'[9'(a"(p( Si ))), . . . , 6'(a"{p( Si )))]p' v - • • • ^ e'[ren p (t), ren p {t)] P - 
where P' y corresponds to the positions of P y in e after renaming. Then, we 
have 

e'[ren p (t), . . . , reK p (t)] P ^ = ren p (e[t, . . . , t] Py )- 
Putting all pieces together, we conclude that there exists a reduction sequence 

e e[t, . . . , t]p — >* true 
in TZ, where P = {pi} or P = P y , such that there exists a reduction sequence 

e' ^ + e'[ren p (t), . . . , ren p {t)] P , 

in 72.', where P' = {p^} or P' = P y , respectively. Since e e[t, . . . ,t]p 

has reduced at least one needed redex in e and e'[ren p (t), . . . , ren p (t)]p> = 
ren p (e[t, . . . , £]p), by applying the induction hypothesis to e[t, . . . , £]p — ^* irite 
in TZ, we get 

e'[ren p (t), . . . , ren p (i)]p' — >* inte 
in TZ' . By composing this sequence with the previous sequence 

e' ^ + e'[ren p (t), ren p {t)] P , 
we get the desired result. 
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□ 

The completeness of NN-PE is a direct consequence of the previous proposition and 
the soundness and completeness of needed narrowing. 

Theorem 14 (completeness) 

Let ft be an inductively sequential program. Let e be an equation, V D Var(e) a 
finite set of variables, S a finite set of operation-rooted terms, and p an independent 
renaming of S. Let ft' be a NN-PE of ft w.r.t. S (under p) such that ft' U {e'} is 
5'-closed, where e' = ren p (e) and S' = p(S). If e true is a needed narrowing 
derivation for e in ft, then there exists a needed narrowing derivation e' ^* a , true 
in ft' such that (a' < a)[V]. 

Proof 

Since e trwe, by the soundness of needed narrowing (claim 1 of Theorem 
we have cr(e) — >* true. Since e' is 5"-closed and a is constructor, by definition of 
closedness, er(e') is also S"-closed and cr(e') = ren p (o-(e)). By Proposition^! there 
exists a rewrite sequence cr(e') — >* true in ft' '. Therefore, since a is a solution of e' 
in ft' and 72/ is inductively sequential (Theorem OJ) , by the completeness of needed 
narrowing (claim 2 of Theorem^), there exists a needed narrowing derivation e' ~~>*, 
true such that (er' < a) [ V] . □ 

B.3 Strong Correctness 

Finally, the strong correctness of the transformation can be easily proved as a di- 
rect consequence of Theorems[7]andEl together with the independence of solutions 
computed by needed narrowing. 

Theorem^ (strong correctness) 

Let 1Z be an inductively sequential program. Let e be an equation, V ^ Var(e) a 
finite set of variables, S a finite set of operation-rooted terms, and p an independent 
renaming of S. Let K' be a NN-PE of K w.r.t. S (under p) such that W U{e'} is S'- 
closed, where e' = ren p (e) and S' — p(S). Then, e true is a needed narrowing 
derivation for e in TZ iff there exists a needed narrowing derivation e' true in 
ft' such that (a' = o-)[V] (up to renaming) . 

Proof 

We consider the two directions separately: 

Strong soundness. We prove the claim by contradiction. Assume that there exists 
some substitution a' computed by needed narrowing for e' in ft' such that there 
is no substitution 9 computed by needed narrowing for e in ft with (0 = &')[V\ 
(up to renaming). 

By Theorem[3 (soundness of NN-PE) and the assumption above, we conclude that 
there must be some substitution a computed by needed narrowing for e in ft such 
that [g < o-')[V}. Then, by Theorem ITU there exists a substitution 8' computed 
by needed narrowing for e' in ft' such that (9' < cr)[V]. Since (9' < a)[V] 
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and (<r < <j')[V], we have (6' < a')[V] which contradicts the independence of 
solutions computed by needed narrowing (claim 3 of Theorem 0). 
Strong completeness. The proof is perfectly analogous, by considering the com- 
pleteness of NN-PE (Theorem 114(1 rather than its soundness (Theorem 0). 



□ 



